Published on
AI

Nebius AI Studio New Features and Capabilities

Nebius AI Studio has introduced a range of new features and capabilities, enhancing its AI functionalities for various applications.

Expanded Model Portfolio

Nebius AI Studio now offers a wider selection of vision and language models, catering to diverse needs. These models include Qwen-2, LLaVA, Meta's Llama 3.3, Dolphin, Phi, and various Llama 3 series models. They support complex reasoning, multilingual scenarios, and specialized tasks like medical applications. Vision models enable image captioning, product recognition, and other visual content interactions.

Enhanced Retrieval Augmented Generation (RAG)

New embedding models like BGE-ICL, e5-mistral, and bge-multilingual-gemma2 have been added to improve RAG pipelines. This facilitates building knowledge bases, advanced semantic search engines, and contextual chatbots.

LoRA Hosting

A usage-based LoRA hosting service allows users to upload and run their pre-trained LoRA models without fixed costs or infrastructure management. Users only pay for the tokens used during inference.

Scalability and Performance

The platform boasts unlimited scalability, handling massive batch processing and ensuring consistent performance for both prototypes and production-level applications. Flexible deployment options allow users to optimize for cost or throughput.

Source(s):

Keep reading

Related posts