| Gemini-1.5-Pro | Multi-modal model from Google's Gemini family that balances model performance and speed. Accepts text, image, and video input with restriction of one video per message. | Google | OFFICIAL | Text Generation |
| Llama-3-70b-Groq | Llama 3 70b powered by the Groq LPU™ Inference Engine | Meta/Groq | OFFICIAL | Text Generation |
| GPT-4.5-Preview | Research preview of GPT-4.5, designed to be more conversational, empathetic & helpful. Supports 128k token context window. | OpenAI | OFFICIAL | Text Generation |
| Claude-3.5-Haiku | The latest generation of Anthropic's fastest model. Has fast speeds and improved instruction following. | Anthropic | OFFICIAL | Text Generation |
| o1 | OpenAI's model designed to reason before responding. Supports 200k tokens of input context and can reason through images. | OpenAI | OFFICIAL SUBSCRIBER ACCESS | Reasoning |
| Mistral-Medium | Mistral AI's medium-sized model with 32k token context window. Stronger than Mixtral-8x7b on benchmarks. | Mistral AI | OFFICIAL | Text Generation |
| o1-pro | OpenAI's highly capable reasoning model for complex, compute-intensive tasks. Spends more time for accurate answers. | OpenAI | OFFICIAL SUBSCRIBER ACCESS | Reasoning |
| Qwen-QwQ-32b | QwQ is the reasoning model of the Qwen series, achieving competitive performance against models like DeepSeek-R1 and o1-mini. | Alibaba | OFFICIAL | Reasoning |
| QwQ-32B-T | Compact, open-source reasoning model with 32B parameters. Strong performance on math, coding, and problem-solving tasks. | Qwen | OFFICIAL | Reasoning |
| ChatGPT-4o-Latest | Dynamic model continuously updated to the current version of GPT-4o in ChatGPT. Supports 128k token context window. | OpenAI | OFFICIAL | Text Generation |
| o3-mini | OpenAI's recent reasoning model for science, math, and coding. Supports 200k tokens of input/output context. | OpenAI | OFFICIAL | Reasoning |
| Llama-4-Scout | Versatile LLM with multi-modal capabilities for tasks like multi-document summarization. Supports 131k tokens context. | Meta | OFFICIAL NEW | Text Generation |
| Llama-4-Maverick-T | State-of-art multimodal model. MoE powerhouse for multilingual image/text understanding across 12 languages. 500k token context. | Meta | OFFICIAL NEW | Text Generation |
| o3-mini-high | OpenAI's reasoning model with high reasoning effort. Excels at science, math, and coding tasks. 200k token context. | OpenAI | OFFICIAL SUBSCRIBER ACCESS | Reasoning |
| o1-mini | Smaller version of OpenAI's o1 model for complex reasoning tasks. Supports 128k tokens of context. | OpenAI | OFFICIAL | Reasoning |
| DeepSeek-V3 | Open-source LLM with state-of-the-art performance in coding, mathematics, and reasoning. 131k context window. | Together | OFFICIAL | Text Generation |
| Gemini-1.5-Flash | Google's model optimized for speed-sensitive tasks. Accepts text, image, and video input with one video per message restriction. | Google | OFFICIAL | Text Generation |
| Llama-3.3-70B-FW | Meta's Llama 3.3 70B, hosted by Fireworks AI. Delivers leading performance at fraction of inference cost. | Meta/Fireworks | OFFICIAL | Text Generation |
| Llama-3.3-70B | Similar performance to Llama 3.1 405B while faster and smaller. Excels in synthetic data generation. | Meta | OFFICIAL | Text Generation |
| GPT-4o-mini | Smart, cost-effective model from OpenAI. As fast as GPT-3.5 Turbo but significantly smarter. | OpenAI | OFFICIAL | Text Generation |
| Llama-3.3-70B-FP16 | Pretrained and instruction tuned LLM optimized for multilingual dialogue use cases. | Meta | OFFICIAL | Text Generation |
| Claude-3.7-Sonnet-Reasoning | Anthropic's most intelligent model with reasoning capabilities enabled by default. 200k token context window. | Anthropic | OFFICIAL | Reasoning |
| DeepSeek-R1 | Open-source reasoning LLM rivaling OpenAI's o1. Excels in math, code, and reasoning tasks. 164k token context. | Together AI | OFFICIAL | Reasoning |
| DeepSeek-R1-FW | State-of-the-art reasoning model for problem solving, math, and coding with chain-of-thought explanations. 164k token context. | Fireworks AI | OFFICIAL | Reasoning |
| Web-Search | Web-enabled assistant that searches the internet for responses. Good for up-to-date information and facts. | GPT 4o Mini | OFFICIAL | Web Search |
Comments
Post a Comment