Nvidia Released a new artificial intelligence (ai) model last week that can be used to train robots on simulation. Dubbed cosmos-transfer 1, The New World Generation Large Language Model (LLM) is aimed at AI-Powered Robotics Hardware, also Known as Physical Ai. The company has released the model in open source with a permissive license, and interested individuals can download it from popular online repositories. The Santa Clara-based tech giant highlighted that the main advantage of the latest ai model is that users will have granular control over the generated simulations.
Nvidia releases ai model to train robots
Simulation-based Robotics Training Has Gained Wind in Recent Times Due to the Advancement in Generative Ai Technology. This specific branch of robotics deals with hardware that uses an ai for its brain. Essentially, the training methods the brain of the machine of the machine of the Various real-world Scenarios so that it can handle a wider range of tasks. This is a big improvement compared to current robots in facties that are designed to complete a single task.
Nvidia’s cosmos-transfer1 is part of the company’s cosmos transfer models (WFMS) which engage structured video input such as segmentation maps, Depth Maps, Depth Maps, Lidar SCANS and LIDAR SCANS and LIDAR SCANS Photoreal video outputs. These outputs can then be used as simulation ground to train physical ai.
In a paper Published in the Arxiv Journal, the company stated that this model offers green customisation than its predacesors. It enables varying the weight of different conditional inputs based on spatial location. Essentially, this will allow developers to generate highly controlable world generation. Another Advantage of the Model Includes Real-Time World Generation that is helpful in faster and more divese training sessions.
Coming to model specification, the cosmos-transfer1 is a Diffusion-Based Model with Seven Billion Parameters. It is designed for video denoising in the latency space, and can be modulated by a Control Branch. The model accepts text and video as input, and using bot, it can generate a photorealistic output video. The model supports four types of control input videos including canny edge, BLURRED RGB, Segmentation Mask, and Depth Map.
The ai model has been tested on Nvidia’s blackwell and hopper series chipsets, and the infererance was run on the linux operating system. The tech giant has made the ai model available with the nvidia open model license agreement which allows bot academic and commercial usage.
Nvidia’s cosmos-transfer1 ai model can be downloaded from the company’s github Listing and hugging face ListingAnother ai model with 14 billion parameters is expected to be released song.