LM Studio

Local & Open Source

LM Studio is a desktop application that lets you discover, download, and run open-source language models directly on your computer — no cloud required. It provides an intuitive chat interface and an OpenAI-compatible local API server, making it a drop-in replacement for cloud APIs in development workflows. With GPU acceleration support for Apple Silicon (Metal), NVIDIA (CUDA), and AMD, it delivers responsive performance even on consumer hardware. It's ideal for researchers working with sensitive data, developers testing against local models, and anyone who values AI privacy.

Key Features

Run LLMs entirely offline with no data leaving your machine

Search and download quantized models directly from Hugging Face

OpenAI-compatible local server for development integration

GPU acceleration: Metal (Apple Silicon), CUDA (NVIDIA), ROCm (AMD)

Built-in model evaluation and comparison tools

Usage Guide

Running Your First Local Model

1
Download LM Studio from lmstudio.ai for your platform.
2
Use the search bar to find a model (e.g., 'Llama 3.3 8B' or 'DeepSeek V4').
3
Select a quantized version that fits your available RAM — lower GB numbers for less RAM usage.
4
Go to the Chat tab, load the model, and start conversing entirely offline.
5
Enable the Local Server to use the model as a drop-in replacement for the OpenAI API in your code.