Model Directory

Luntrex PDF 1

Render clean, production-ready PDFs from HTML, URLs, or image inputs with fine-grained control over size, margins, and orientation. Optimized for fast, reliable exports and secure storage.

by Luntrex AI 0.005000000000 Input 0.008000000000 Output

0.013 tokens

GPT 4.1 Mini

Lightweight, fast, and cost-efficient for everyday tasks. GPT 4.1 Mini delivers reliable chat, code, and analysis with low latency and budget-friendly pricing—ideal for high-volume or background workloads

by OpenAi 0.015000000000 Input 0.026000000000 Output

0.041 tokens

Dall E 2

Image generation is powered by DALL-E 2, a generative model based on a diffusion architecture. It leverages CLIP (Contrastive Language-Image Pre-training) to map text inputs to a visual semantic space. The system uses a two-stage process: a 'prior' that generates a CLIP image embedding from the text caption, and a diffusion 'decoder' (unCLIP) that produces the final image from that embedding. This architecture enables the model to understand complex relationships between objects and generate coherent, high-resolution visuals.

by OpenAi 0.080000000000 Input 0.090000000000 Output

0.17 tokens

Trancribe Audio

Whisper-1 is OpenAI’s premier speech-to-text model, accessible via their API and optimized for high-accuracy, multilingual audio processing. Built on an encoder-decoder Transformer architecture, it was trained on 680,000 hours of diverse, weakly supervised web data, making it exceptionally robust against background noise, various accents, and technical jargon. Whisper-1 functions as a multitask system capable of automatic speech recognition (ASR), language identification, and seamless speech translation from dozens of languages into English. It is widely recognized for delivering near-human-level transcription and is the industry standard for creating accessible, searchable, and translated audio content.

by OpenAi 0.083000000000 Input 0.089000000000 Output

0.172 tokens

Dall E 3

Our application leverages the enhanced capabilities of DALL-E 3 to deliver superior visual fidelity. Key improvements over previous generations include the ability to render legible text and typography directly within images, precise adherence to complex instructions (such as specific object placement), and a conversational interface that allows users to iteratively refine their visuals using natural dialogue rather than rigid prompt syntax.

by OpenAi 0.008000000000 Input 0.009000000000 Output

0.017 tokens

Gemini 2.5 Flash

Gemini 2.5 Flash is a high-performance, lightweight multimodal model developed by Google, engineered for speed and cost-efficiency without compromising on intelligence. It features a massive 1 million token context window, allowing it to process and reason across vast datasets, long documents, and extensive codebases. Distinguished as a 'thinking' model, it utilizes adaptive reasoning to modulate its processing power based on task complexity, delivering superior prompt adherence and high-fidelity outputs. With native support for text, image, audio, and video inputs, Gemini 2.5 Flash is optimized for low-latency, real-time applications and complex agentic workflows.

by Gemini 0.072000000000 Input 0.062000000000 Output

0.134 tokens

DeepL Translator

DeepL Translator is a world-leading neural machine translation (NMT) service known for its industry-topping accuracy and natural-sounding output. Powered by advanced deep learning architectures—specifically highly tuned convolutional neural networks (CNNs) and next-generation large language models (LLMs)—it excels at capturing subtle nuances, idiomatic expressions, and complex sentence structures that traditional translation engines often miss. DeepL is specifically engineered for professional use, offering advanced features like Glossaries for brand consistency, Formal/Informal tone control, and the ability to translate entire documents while preserving original formatting. With an emphasis on data security and privacy, it is the premier choice for enterprise-grade localization and high-fidelity multilingual communication.

by DeepL 0.010000000000 Input 0.080000000000 Output

0.09 tokens

Leonardo Alchemy

Leonardo Alchemy is a high-performance generative pipeline within the Leonardo.ai ecosystem, designed to deliver professional-grade image fidelity and sophisticated creative control. Built upon a custom-tuned Stable Diffusion XL (SDXL) framework, Alchemy utilizes advanced image-processing techniques such as Contrast Boost and Resonance to enhance dynamic range and prompt adherence. It is specifically engineered for 'production-ready' assets, offering specialized presets for photorealism, cinematic styles, and concept art. By integrating features like High-Resolution Upscaling and Image Guidance, Leonardo Alchemy allows for granular manipulation of lighting, texture, and composition, making it the premier choice for game developers, designers, and digital artists requiring consistent, high-fidelity visual outputs

by Leonardo 0.005000000000 Input 0.008000000000 Output

0.013 tokens

GPT-5.2

GPT‑5.2 is our highest-fidelity coding and agentic model, tuned for complex software tasks across industries. It offers fast, precise code generation and refactoring, strong tool-use planning, and robust reasoning for multi-step automations—while remaining compatible with our unified API and rate-limited for predictable costs

by OpenAi 0.070175000000 Input 0.031400000000 Output

0.101575 tokens

Gemini 3 Flash

Gemini 3 Flash is built for speed and high-frequency efficiency. It is the "workhorse" of the Gemini family, designed to provide frontier-level intelligence with the lowest possible latency. Best For: Real-time applications, high-volume automated tasks, and "agentic" workflows where the AI needs to move through many small steps quickly. Key Strength: It delivers "Pro-grade" reasoning (scoring impressively high on PhD-level benchmarks) but is optimized to use fewer tokens and respond nearly instantaneously, making it the most cost-effective choice for scaling.

by Gemini 0.080800000000 Input 0.076000000000 Output

0.1568 tokens

Gemini 3 Pro

Gemini 3 Pro is the flagship model for complex reasoning and multimodal depth. It is designed for high-precision tasks where the quality of the "thought process" is more critical than the speed of the delivery. Best For: Deep research synthesis, complex coding architecture, advanced scientific problem-solving, and analyzing massive datasets (up to 1 million tokens). Key Strength: It excels at "long-horizon" planning and spatial reasoning. If you need a model to look at a 45-minute video or a 1,000-page document and find a needle in a haystack with perfect accuracy, Pro is the superior choice.

by Gemini 0.085000000000 Input 0.082000000000 Output

0.167 tokens

Gemini-2.5-flash-image

Gemini 1.5 Flash is a high-speed, lightweight AI model designed for efficiency and near-instant responses while still handling massive amounts of information (like long documents or many images) at once.

by Gemini 0.090000000000 Input 0.080000000000 Output

0.17 tokens

GPT Image 1

gpt-image-1 is OpenAI’s high-quality multimodal image model for generating and editing images with stronger detail, composition, and text rendering than lightweight variants.

by OpenAi 0.050000000000 Input 0.500000000000 Output

0.55 tokens

GPT-5 Mini

GPT-5 Mini is a faster, lower-cost variant of GPT-5 tuned for well-defined tasks. It delivers reliable chat/completion outputs with significantly lower latency and price, making it ideal for routing high-volume automations and predictable flows while keeping quality close to GPT-5.

by OpenAi 0.003000000000 Input 0.005000000000 Output

0.008 tokens

Jana

by Elevenlabs 6.770000000000 Input 0.200000000000 Output

6.97 tokens

Bill - Wise, Mature, Balanced

by Elevenlabs 6.770000000000 Input 0.100000000000 Output

6.87 tokens

Adam - Dominant, Firm

by Elevenlabs 6.770000000000 Input 0.100000000000 Output

6.87 tokens

Brian - Deep, Resonant and Comforting

by Elevenlabs 6.770000000000 Input 0.900000000000 Output

7.67 tokens

Mark - Natural Conversations

by Elevenlabs 6.880000000000 Input 0.900000000000 Output

7.78 tokens

GPT-image-1-mini

gpt-image-1-mini is a fast, lower-cost image model for generating and editing images from text prompts, ideal for lightweight creative tasks and rapid iteration

by OpenAi 0.080000000000 Input 0.100000000000 Output

0.18 tokens

Models