AI Infrastructure

2 articles · Articles about ai infrastructure

LocalAI: One API for All Your AI Models

LocalAI provides a unified OpenAI-compatible API that routes to Ollama, llama.cpp, and other backends — drop-in replacement for GPT-4.

Deploy vLLM for 10-24x faster throughput than naive LLM serving — PagedAttention, continuous batching, OpenAI-compatible API.