Skip to content
LM Studio

LM Studio

Local & Open Source

LM Studio is a desktop application that lets you discover, download, and run open-source language models directly on your computer — no cloud required. It provides an intuitive chat interface and an OpenAI-compatible local API server, making it a drop-in replacement for cloud APIs in development workflows. With GPU acceleration support for Apple Silicon (Metal), NVIDIA (CUDA), and AMD, it delivers responsive performance even on consumer hardware. It's ideal for researchers working with sensitive data, developers testing against local models, and anyone who values AI privacy.

Key Features

Run LLMs entirely offline with no data leaving your machine
Search and download quantized models directly from Hugging Face
OpenAI-compatible local server for development integration
GPU acceleration: Metal (Apple Silicon), CUDA (NVIDIA), ROCm (AMD)
Built-in model evaluation and comparison tools

Usage Guide

Running Your First Local Model

  1. 1
    Download LM Studio from lmstudio.ai for your platform.
  2. 2
    Use the search bar to find a model (e.g., 'Llama 3.3 8B' or 'DeepSeek V4').
  3. 3
    Select a quantized version that fits your available RAM — lower GB numbers for less RAM usage.
  4. 4
    Go to the Chat tab, load the model, and start conversing entirely offline.
  5. 5
    Enable the Local Server to use the model as a drop-in replacement for the OpenAI API in your code.