One API Access 500+ AI Models - CometAPI (original) (raw)

CClaude Fable 5The timeline for restored availability is subject to Anthropic’s notice. Due to access restrictions, this model may fail to respond. We recommend using Opus 4.8 as an alternative for now.OGPT Image 2GPT Image 2 is openai state-of-the-art image generation model for fast, high-quality image generation and editing. It supports flexible image sizes and high-fidelity image inputs.DDoubao-Seedance-2-0Per Second:$0.063Seedance 2.0 is ByteDance’s next-generation multimodal video foundation model focused on cinematic, multi-shot narrative video generation. Unlike single-shot text-to-video demos, Seedance 2.0 emphasizes reference-based control (images, short clips, audio), coherent character/style consistency across shots, and native audio/video synchronization — aiming to make AI video useful for professional creative and previsualization workflows.QHappy Horse 1.0Per Second:$0.112Happy Horse 1.0 — A high-quality audio-video generation model that supports text-to-video and image-to-video creation. It can generate synchronized visuals, audio, and lip movements, making it suitable for short films, advertising creatives, and product showcases.CClaude Opus 4.8Claude Opus 4.8 is a premium AI model designed for advanced reasoning, deep analysis, and high-quality content generation. It excels at handling complex instructions, long-context understanding, and sophisticated problem-solving across professional and technical domains.GGemini 3.5 FlashInput:$1.2/MOutput:$7.2/MGemini 3.5 Flash is a high-speed AI model designed for fast response and efficient coding performance. It delivers significantly improved generation speed while maintaining strong reasoning ability, making it suitable for real-time applications and developer workflows.GGemini 3.1 ProInput:$1.6/MOutput:$9.6/MGemini 3.1 Pro is the next generation in the Gemini series of models, a suite of highly-capable, natively multimodal, reasoning models. Gemini 3 Pro is now Google’s most advanced model for complex tasks, and can comprehend vast datasets, challenging problems from different information sources, including text, audio, images, video, and entire code repositoriesMKimi K2.7 CodeInput:$0.76/MOutput:$3.19998/MKimi K2.7 Code is Kimi's most intelligent coding model to date, reliably following instructions in long contexts and completing programming tasks with a higher success rate. It supports text, image, and video input, and only supports thought mode, dialogue, and agent tasks.CClaude Mythos 5Anthropic's most capable, widely released model, for the most demanding reasoning and long-horizon agentic workCClaude Opus 4.7Claude Opus 4.7 is a hybrid reasoning model designed specifically for frontier-level coding, AI agents, and complex multi-step professional work. Unlike lighter models (e.g., Sonnet or Haiku variants), Opus 4.7 prioritizes depth, consistency, and autonomy on the hardest tasks.MMiniMax-M3Input:$0.48/MOutput:$1.92/MMinimax-m3 is a multimodal AI model designed for strong reasoning, natural conversation, and creative content generation. It provides balanced performance across text and visual understanding tasks, making it suitable for general-purpose AI applications.Grok 4.3 is a general-purpose AI model designed for strong reasoning, real-time information processing, and conversational intelligence. It delivers improved accuracy and responsiveness, making it suitable for coding, analysis, and everyday productivity tasks.OGPT 5.5 ProGPT-5.5 Pro combines state-of-the-art intelligence, precision, and efficiency to tackle sophisticated challenges. From software development and data analysis to research and decision support, it delivers expert-level assistance with speed and consistency.Model 5.5 is a next-generation AI model designed for stronger reasoning, faster responses, and improved accuracy across a wide range of tasks. It excels at understanding complex instructions, generating high-quality content, and assisting with coding, analysis, and problem-solving.DDeepSeek V4 FlashInput:$0.12/MOutput:$0.24/MDeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and high-throughput workloads, while maintaining strong reasoning and coding performance.DDeepSeek V4 ProInput:$0.416/MOutput:$0.832/MDeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding, and long-horizon agent workflows, with strong performance across knowledge, math, and software engineering benchmarks.MMiniMax-M2.7Input:$0.24/MOutput:$0.96/MMiniMax-M2.7 offers the same top-tier intelligence as the standard version—including recursive self-evolution and expert-level office productivity—but is designed for applications requiring sub-second latency and high-speed token generation. Leveraging an enhanced inference backbone architecture, its output speed is 66% faster than the standard model (reaching 100 tps). It is the preferred choice for interactive programming assistants, real-time agent loop execution, and high-throughput enterprise pipelines with stringent completion time requirements.OGPT-5.4 nanoContext:400,000 GPT-5.4 Nano is an ultra-lightweight AI model built for maximum speed and efficiency. It is optimized for simple tasks, real-time interactions, and large-scale deployments where low latency and minimal resource consumption are essential.OGPT-5.4 miniContext:400,000 Input:$0.6/MOutput:$3.6/MGPT-5.4 Mini is a lightweight and efficient AI model optimized for speed and everyday productivity. It provides reliable conversational capabilities, content generation, and task assistance while maintaining low latency and resource usage.OGPT-5.4 proContext:1,050,000 GPT-5.4 Pro is a high-performance AI model designed for professional and business applications. It offers strong reasoning, reliable accuracy, and efficient execution across tasks such as content creation, coding, research, and data analysis.GNano Banana 2Input:$0.4/MOutput:$2.4/MCore Capabilities Overview: Resolution: Up to 4K (4096×4096), on par with Pro. Reference Image Consistency: Up to 14 reference images (10 objects + 4 characters), maintaining style/character consistency. Extreme Aspect Ratios: New 1:4, 4:1, 1:8, 8:1 ratios added, suitable for long images, posters, and banners. Text Rendering: Advanced text generation, suitable for infographics and marketing poster layouts. Search Enhancement: Integrated Google Search + Image Search. Grounding: Built-in thinking process; complex prompts are reasoned before generation.CClaude Sonnet 4.6Claude Sonnet 4.6 is our most capable Sonnet model yet. It’s a full upgrade of the model’s skills across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Sonnet 4.6 also features a 1M token context window in beta.Input:$1.12/MOutput:$3.528/MGLM-5.2 is a significant update from Zhipu in the areas of open-source large models and AI coding.QQwen3.7 PlusInput:$0.32/MOutput:$1.28/MQwen3.7 Plus is a high-performance large language model developed by Alibaba Cloud. It supports long-context understanding up to 128K tokens, function calling, and multilingual tasks. Designed for complex reasoning, coding, and instruction-following scenarios.GGemini omni fastPer Request:$0.4Gemini Omni Fast is a lightweight multimodal video generation model designed for fast and flexible content creation. It enables efficient video generation with support for multiple input types, making it suitable for interactive and iterative workflows.QQwen3.7-MaxInput:$1.36/MOutput:$4.08/MQwen3.7-Max's core strength lies in the breadth and depth of its agentic capabilities. In coding, it handles everything from front-end prototyping to complex multi-file engineering projects. For office and productivity work, it enables workflow automation through MCP integration and multi-agent collaboration. In long-horizon autonomous execution, it maintained coherent reasoning throughout a 35-hour, fully autonomous kernel optimization experiment involving over 1,000 tool calls — convincingly demonstrating its sustained, stable execution. Furthermore, it delivers consistently strong cross-framework generalization, performing reliably whether deployed in Claude Code, OpenClaw, Qwen Code, or other frameworks.OGPT Image 2 ALLPer Request:$0.04GPT Image 2 ALL is a comprehensive image generation model designed to handle a wide range of creative and professional visual tasks. It combines high-quality image creation, advanced prompt understanding, and versatile style support to deliver exceptional results across diverse use cases.OGPT 5.5 ALLInput:$2.4/MOutput:$14.4/MGPT-5.5 excels in code writing, online research, data analysis, and cross-tool operations. The model not only improves its autonomy in handling complex multi-step tasks but also significantly improves reasoning capabilities and execution efficiency while maintaining the same latency as its predecessor, marking an important step towards automated office automation in AI.XGrok 4.20Context:2,000,000Grok 4.20 release introduces a multi-agent architecture (multiple specialized agents coordinated in real time), expanded context modes, and focused improvements to instruction-following, hallucination reduction, and structured/tooled outputs.