Octopus V3

The smallest, most powerful on-device multimodal model for super AI agents --- fast, accurate, energy-efficient

Request access

Introducing Octopus V3

smallest, most powerful on-device multimodal model for super AI agents

Compact size

Less than 1B parameters

Multimodal

Processes both text and images for function calling

High Performance

On par with a combination of GPT-4V and GPT-4

Multilingual

Fluent in English and Mandarin

Request access

Cool things Octopus V3 can do:

Octopus V3 can process both visual and textual user inputs, executing tasks swiftly and precisely. Its compact design and integration of visual data enable highly accurate and context-aware function calls. Additionally, it is energy-efficient and ensures robust data privacy.

Octopus V3 Demo Video

Instacart

[data-wf-bgvideo-fallback-img] { display: none; } @media (prefers-reduced-motion: reduce) { [data-wf-bgvideo-fallback-img] { position: absolute; z-index: -100; display: inline-block; height: 100%; width: 100%; object-fit: cover; } }

Google Search

Send Email

Amazon

Compact Multimodal AI for Edge Devices

Discover EdgeAI, a compact AI model for edge devices, handling text, visuals, and audio in English and Chinese. Efficient on low-power devices. Access demos and tools for research.

Read Technical Report

Explore our collection of 200+ Premium Webflow Templates