262+ curated AI tools and growing

API & Models

Browse AI tools in this category

⌘ K

Hot searches

All Tools

27 tools

GroqAPI & Models

Groq LPU inference with ultra-fast Llama and open models via API.

Fast★ 4.9·2M+

Ollama✓API & Models

CLI to run open LLMs locally for self-hosting and developer integration.

Local★ 4.1·2.3k

Open WebUIAI Chat

Self-hosted open chat UI for Ollama and OpenAI-compatible APIs.

Open source★ 4.3·50k

OpenRouterAPI & Models

Unified API gateway to call GPT, Claude, Llama, and hundreds of models.

API★ 4.4·50k

Together AIAPI & Models

Cloud inference for open models—Llama, Mixtral APIs and fine-tuning.

API★ 4.6·2M+

ReplicateAPI & Models

Cloud API platform to run image, video, and language models.

API★ 4.3·12.5k

Self-hostable open-source AI code completion server.

Open source★ 4.6·800k

Refact.aiAI Coding

Open AI coding agent with self-hosting and fine-tuning.

Agent★ 4.8·10M+

Hugging Face✓API & Models

Open model hub with Inference API and Spaces deployment.

Open source★ 4.4·2.3k

fal.aiAPI & Models

Generative media model API for fast image/video/audio inference.

API★ 4.8·800k

DeepInfraAPI & Models

Low-cost LLM and diffusion model inference API hosting.

API★ 4.5·2.3k

Fireworks AIAPI & Models

Fast open-model inference API with tool use and fine-tuning.

Inference★ 4.8·50k

AnyscaleAPI & Models

Ray-based model deployment and LLM inference platform.

Ray★ 4.6·2.3k

ModalAPI & Models

Serverless GPU cloud to run AI workloads by the second.

GPU★ 4.3·2M+

RunPodAPI & Models

GPU cloud and serverless inference for custom models.

GPU★ 4.1·10M+

Lepton AIAPI & Models

Lightweight AI app and model deployment, Python-first.

Deploy★ 4.8·12.5k

BasetenAPI & Models

Truss-based model deployment and production inference APIs.

MLOps★ 4.4·10M+

Cerebras InferenceAPI & Models

Ultra-fast LLM inference API on Cerebras wafer-scale chips.

Fast★ 4.3·12.5k

DeepgramAPI & Models

Speech-to-text and TTS APIs with low-latency streaming.

STT★ 4.6·12.5k

AssemblyAIAPI & Models

Speech AI APIs for transcription, summarization, and LeMUR LLM.

Transcription★ 4.9·2.3k

PineconeAPI & Models

Vector database and RAG retrieval API for LLM apps.

Vector★ 4.4·50k

llama.cppAPI & Models

Local LLM inference engine running GGUF models on CPU/GPU.

Local★ 4.2·12.5k

LM StudioAI Chat

Desktop app to run local LLMs with OpenAI-compatible API.

Local★ 4.8·2.3k

Amazon NovaAI Chat

AWS Nova multimodal foundation models for chat and content.

AWS★ 4.8·12.5k

Yi by 01.AIAI Chat

01.AI Yi models for bilingual chat and API access.

China★ 4.5·2.3k

ComfyUIAI Image

Node-based Stable Diffusion workflows for custom generation pipelines.

Nodes★ 4.8·800k

Resemble AIAI Audio

Voice cloning and emotional TTS API for games and media.

Cloning★ 4.8·800k