New FM and its Applications
Training & Inference
Research on Explainability
and Other Aspects
Virtual Agent
Physical Agent
To support the company's medium- and long-term development on Artificial General Intelligence (AGI) in term of fundamental research, enhance independent innovation, accelerate the pace of realization of AGI, and at the same time focus on scientific and technological innovation in AGI-related topics to improve the theoretical and technological competitiveness of the enterprise.
Crossing the "Singularity", exploring the unknown, creating a better future, and becoming an outstanding scientific research organization for the future world!
Fellow of AAAI, ACM, SAEng, IEEE, IAPR
New FM and its Applications
Training & Inference
Research on Explainability
and Other Aspects
Virtual Agent
Physical Agent
Reinforcement Learning from Diverse Human Preferences
Wanqi Xue, Bo An, Shuicheng Yan, Zhongwen Xu
IJCAI 2024 Conference
August 2024
Keywords: Reinforcement Learning, Human Preferences, Human Feedback, Rewards
Exploring Diffusion Time-steps for Unsupervised Representation Learning
Zhongqi Yue, Jiankun Wang, Qianru Sun, Lei Ji, Eric I-Chao Chang, Hanwang Zhang
ICLR 2024 Conference
May 2024
Keywords: unsupervised representation learning, diffusion model, representation disentanglement, counterfactual generation
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
Longtao Zheng, Rundong Wang, Xinrun Wang, Bo An
ICLR 2024 Conference
May 2024
Keywords: AI Agents, Large Language Models, Prompting
True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning
Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An
ICLR 2024 Conference
May 2024
Keywords: Reinforcement Learning, Large Language Models, Parameter-Efficient Fine-Tuning
Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment
Hao Fei; Shengqiong Wu; Meishan Zhang; Min Zhang; Tat-Seng Chua; Shuicheng Yan
IEEE Transactions on Pattern Analysis and Machine Intelligence
April 2024
Keywords: Videos, Semantics, Transformers
Skywork
Skywork series models are pre-trained on a 3.2TB sized high-quality multilingual dataset (predominantly Chinese and English).
Vitron
Vitron is a universal pixel-level vision LLM, designed for comprehensive understanding (perceiving and reasoning), generating, segmenting (grounding and tracking), editing (inpainting) of both static image and dynamic video content.
AgentStudio
AgentStudio is an open toolkit covering the entire lifespan of building virtual agents that can interact with everything in digital worlds.
PointCloudMamba
Point Cloud Mamba surpasses the SOTA point-based method, PointNeXt and achieves new SOTA performance on the ScanObjectNN, ModelNet40, and ShapeNetPart datasets.
Skywork
FOR THE BEST AND THE BRIGHTEST