Does Locally Uncensored support Qwen 3.6?

Yes. Qwen 3.6 (35B MoE) has day-0 integration — vision, agentic coding, thinking preservation, 256K context. One-click download via the Model Manager.

Does it support GPT-OSS?

Yes. GPT-OSS-120B and GPT-OSS-20B run via Ollama, which is one of the twelve auto-detected local backends.

Does it support GLM-4.7?

Yes — GLM-4.7 Flash is supported through Ollama and other compatible backends. Eleven variants are available for one-click download.

Does it support DeepSeek R1 and Llama 4?

Yes. DeepSeek R1, DeepSeek V3, Llama 4, Llama 3.3, Gemma 4, Mistral Small 3, Phi 4 and more are all supported through any of the twelve auto-detected backends.

How is this different from Open WebUI, LM Studio, Jan or Msty?

Those tools handle text chat. Locally Uncensored adds a coding agent with 14 MCP tools, ComfyUI image generation (FLUX, Juggernaut, Z-Image, SDXL), video generation (Wan 2.1, HunyuanVideo, LTX 2.3, FramePack), mobile remote access via LAN or Cloudflare Tunnel, A/B model compare, local benchmarking, granular permissions, file upload with vision, and thinking mode. All in one app.

Can I run it on macOS?

Not yet. Windows 10/11 and Linux for now. macOS is on the roadmap but not promised. The source is AGPL-3.0 — build for your platform if needed.

Is there a mobile app?

Yes. The desktop app hosts a full mobile web app over LAN or Cloudflare Tunnel. Chat, Codex coding, 14 agent tools, plugins, personas — all from your phone.

Locally Uncensored — Desktop AI for Chat, Code, Image & Video

Not a chat app. The whole studio.

Chat with Qwen 3.6, GPT-OSS, GLM-4.7, Llama 4 and 70+ more.

Twenty-plus presets auto-detected: Ollama, LM Studio, vLLM, KoboldCpp, llama.cpp, OpenAI, Anthropic, Groq, OpenRouter, and eleven more. Day-0 Qwen 3.6, GPT-OSS, GLM-4.7 Flash, DeepSeek R1, Gemma 4, Mistral Small 3, Phi 4. Vision. Memory. RAG. Personas. Thinking-mode. A/B compare.

Qwen 3.6GPT-OSSGLM-4.7DeepSeek R1

Code with three agents built in.

Codex with live token streaming and apply-patch. Claude Code CLI integrated. An Agent Mode with fourteen tools, parallel execution, MCP, and sub-agent delegation. No separate IDE, no context-switching.

CodexClaude Code14 toolsMCP

Create with FLUX 2, Juggernaut, Wan 2.1, LTX 2.3.

ComfyUI runs in the background. Image: FLUX 2 Klein, FLUX.1, Juggernaut XL, Z-Image Turbo, ERNIE-Image, SDXL, SD 3.5. Video: Wan 2.1, HunyuanVideo 1.5, LTX 2.3, FramePack F1, AnimateDiff. Image-to-image and image-to-video. Seventy-five-plus one-click downloads, hardware-aware.

FLUX 2Juggernaut XLLTX 2.3Wan 2.1

Remote access from your phone.

LAN or Cloudflare Tunnel. Six-digit passcode. QR code setup. Full mobile web app: chat, Codex, tools, plugins. Your desktop does the compute — your phone drives it from anywhere.

LANtunnelQRmobile

Made for people who want their AI on their machine.

Most local-AI tools stop at text chat. This one gives you the whole studio: text, code, images, video, and your phone can drive it from anywhere. No data leaves your box unless you tell it to. No subscription. No telemetry.

Open source under AGPL-3.0. Auto-detects twelve backends and downloads models in one click. Signed auto-update channel, covered by 2,200+ unit tests.

Running in under five minutes.

No Docker. No terminal. No config files. Run the installer, let the wizard scan your system, start chatting.

01 · Install

Download & install.

One installer. Windows · Linux. Auto-updates over a signed channel. AGPL-3.0.

02 · Detect

Wizard finds everything.

First launch scans twelve local backends — Ollama, LM Studio, vLLM, KoboldCpp, Jan, llama.cpp, and more. One-click install links if nothing is running.

03 · Run

Chat. Code. Create.

Pick a model, start chatting. Switch to Codex for coding, open Create for images or video. Add cloud providers in Settings any time.

Works with the best local AI models.

Auto-detects models from any running backend. Day-0 support for Qwen 3.6, GPT-OSS, GLM-4.7 Flash, DeepSeek R1, Llama 4, Gemma 4, Mistral Small 3, and Phi 4. Seventy-five-plus one-click downloads, hardware-aware recommendations, VRAM-tier filtering.

CHAT · VISION

Gemma 4

Google flagship. Native tools, vision, Apache 2.0. E4B runs on 4 GB, 27B on 16 GB.

E4B · 27B

CHAT · REASONING

Qwen 3.6 · GPT-OSS · GLM-4.7

Strongest reasoning and coding. Qwen 3.6 (35B MoE, vision, 256K context, day-0). GPT-OSS-120B / 20B via Ollama. GLM-4.7 Flash. DeepSeek R1 and Llama 4 ready.

ABLITERATED VARIANTS · 8-22 GB VRAM

IMAGE

Juggernaut XL

Popular SDXL finetune. Strong photorealism, community favourite. Text-to-Image and Image-to-Image. 8-10 GB VRAM.

SDXL · FINETUNE

IMAGE · UNCENSORED

Z-Image Turbo

Explicitly uncensored. Eight to fifteen seconds per image. No safety filters. T2I and I2I.

10-16 GB VRAM

VIDEO · TEXT-TO-VIDEO

LTX 2.3

Lightricks LTX-Video. Fast text-to-video on modest hardware. Long clips, sharp motion, one-click setup.

T2V · 10-14 GB

VIDEO · IMAGE-TO-VIDEO

FramePack F1

Image-to-video on just 6 GB VRAM. Upload an image, get video. Next-frame prediction.

I2V · 6 GB

Common questions.

What is Locally Uncensored?

A free, open-source desktop app for running AI locally. Combines chat (20+ provider presets), a coding agent (Codex) with fourteen tools, image generation via ComfyUI (FLUX 2, Juggernaut XL, Z-Image, SDXL), and video generation (Wan 2.1, HunyuanVideo, LTX 2.3, FramePack F1) in one interface. AGPL-3.0 licensed.

Does it support Qwen 3.6, GPT-OSS and GLM-4.7?

Yes. Qwen 3.6 has day-0 integration (35B MoE, vision, agentic coding, 256K context). GPT-OSS-120B and GPT-OSS-20B run via Ollama. GLM-4.7 Flash supported through Ollama. Also Day-0 ready: DeepSeek R1, Llama 4, Gemma 4, Mistral Small 3, Phi 4.

Can I use it as a ChatGPT or Claude alternative?

Yes. Locally Uncensored works as a ChatGPT and Claude alternative that runs on your own hardware. Use Qwen 3.6, GPT-OSS, GLM-4.7, DeepSeek R1, Llama 4 or Gemma 4 instead — or add cloud providers (OpenAI, Anthropic, OpenRouter, Groq) alongside the local stack.

Is it really free and offline?

Yes. After setup and model download, no internet is needed for the local providers. No accounts, no telemetry, no usage limits. Cloud providers are optional — the core runs one-hundred percent on your hardware.

How is this different from Open WebUI or LM Studio?

Those tools handle text chat. Locally Uncensored adds a coding agent with fourteen MCP tools, image generation, video creation, A/B model comparison, local benchmarking, granular permissions, file upload with vision, and thinking mode — all in one app.

What hardware do I need?

Text chat: 8 GB RAM. Image generation: NVIDIA GPU with 8+ GB VRAM. Video generation: 10-12 GB VRAM. The app auto-detects hardware and recommends models. Windows 10/11 and Linux supported.

What does “uncensored” mean?

Abliterated models with artificial restrictions removed. The AI responds honestly without refusing or adding disclaimers. Combined with local execution, your conversations stay private.

Does remote access leak data?

Only if you explicitly dispatch a chat over LAN or Cloudflare Tunnel. Remote is opt-in, gated behind a six-digit passcode, and you see exactly when a device is connected. No background uploads. No telemetry.

Can I use this on macOS?

Not yet. Windows and Linux for now. macOS support is on the roadmap but not promised. The source is AGPL-3.0 if you want to build for your platform.

Locally Uncensored — 2.4.5

Generate Anything.Locally. Uncensored.

Not a chat app. The whole studio.

Chat with Qwen 3.6, GPT-OSS, GLM-4.7, Llama 4 and 70+ more.

Code with three agents built in.

Create with FLUX 2, Juggernaut, Wan 2.1, LTX 2.3.

Remote access from your phone.

Made for people who want their AI on their machine.

Running in under five minutes.

Download & install.

Wizard finds everything.

Chat. Code. Create.

Works with the best local AI models.

Gemma 4

Qwen 3.6 · GPT-OSS · GLM-4.7

Juggernaut XL

Z-Image Turbo

LTX 2.3

FramePack F1

Guides, comparisons, release notes.

How to Run Qwen 3.6 Locally

Abliterated Models Guide

v2.4.0 — Settings Polish + Linux Drag Fix

Google Gemma 4 — Run It Locally

Image-to-Image with Local AI

Best Local AI Apps in 2026

How to Run Uncensored AI Locally

LU vs Open WebUI

LU vs LM Studio

ComfyUI for Beginners

Generating AI Videos Locally