- Published on
Nebius AI Studio has introduced a range of new features and capabilities, enhancing its AI functionalities for various applications.
Expanded Model Portfolio
Nebius AI Studio now offers a wider selection of vision and language models, catering to diverse needs. These models include Qwen-2, LLaVA, Meta's Llama 3.3, Dolphin, Phi, and various Llama 3 series models. They support complex reasoning, multilingual scenarios, and specialized tasks like medical applications. Vision models enable image captioning, product recognition, and other visual content interactions.
Enhanced Retrieval Augmented Generation (RAG)
New embedding models like BGE-ICL, e5-mistral, and bge-multilingual-gemma2 have been added to improve RAG pipelines. This facilitates building knowledge bases, advanced semantic search engines, and contextual chatbots.
LoRA Hosting
A usage-based LoRA hosting service allows users to upload and run their pre-trained LoRA models without fixed costs or infrastructure management. Users only pay for the tokens used during inference.
Scalability and Performance
The platform boasts unlimited scalability, handling massive batch processing and ensuring consistent performance for both prototypes and production-level applications. Flexible deployment options allow users to optimize for cost or throughput.
Source(s):
Keep reading
Related posts
Nov 23, 2024
0CommentsContinue AI Powered Coding Assistant for VS Code and JetBrains
Discover how Continue, an open-source AI tool, enhances coding in VS Code and JetBrains IDEs with real-time suggestions, seamless editing, and more.
Dec 7, 2025
0CommentsSecure Your Data, Unlock AI: Deploy Open WebUI Locally with Remote Ollama GPU
This technical guide details the deployment of Open WebUI in a Docker container via WSL, configured to interface with a remote, GPU-accelerated Ollama instance on a local network. Follow these steps for a decoupled, high-performance LLM interface setup.
Jun 17, 2025
0CommentsIntroducing Codestral Embed: Mistral AI's New State-of-the-Art Code Embedding Model
Analysis of Mistral AI's Codestral Embed, a new state-of-the-art embedding model specialized for code, detailing its performance, flexibility, key use cases like RAG and semantic search, and availability.