API & Models
Browse AI tools in this category
All Tools
27 tools

Groq LPU inference with ultra-fast Llama and open models via API.

CLI to run open LLMs locally for self-hosting and developer integration.

Self-hosted open chat UI for Ollama and OpenAI-compatible APIs.

Unified API gateway to call GPT, Claude, Llama, and hundreds of models.

Cloud inference for open models—Llama, Mixtral APIs and fine-tuning.

Cloud API platform to run image, video, and language models.

Self-hostable open-source AI code completion server.

Open AI coding agent with self-hosting and fine-tuning.

Open model hub with Inference API and Spaces deployment.

Generative media model API for fast image/video/audio inference.

Low-cost LLM and diffusion model inference API hosting.

Fast open-model inference API with tool use and fine-tuning.

Ray-based model deployment and LLM inference platform.

Serverless GPU cloud to run AI workloads by the second.

GPU cloud and serverless inference for custom models.

Lightweight AI app and model deployment, Python-first.

Truss-based model deployment and production inference APIs.

Ultra-fast LLM inference API on Cerebras wafer-scale chips.

Speech-to-text and TTS APIs with low-latency streaming.

Speech AI APIs for transcription, summarization, and LeMUR LLM.

Vector database and RAG retrieval API for LLM apps.

Local LLM inference engine running GGUF models on CPU/GPU.

Desktop app to run local LLMs with OpenAI-compatible API.

AWS Nova multimodal foundation models for chat and content.

01.AI Yi models for bilingual chat and API access.

Node-based Stable Diffusion workflows for custom generation pipelines.

Voice cloning and emotional TTS API for games and media.
