
This Week's Shiny New AI Model
Alibaba Just Open-Sourced a Coding AI That Runs on a Single GPU
Alibaba's new Qwen3.6-35B-A3B activates just 3B of its 35B parameters, letting it run on a single GPU.
Alibaba's Qwen team dropped a new model this week, Qwen3.6-35B-A3B. Name's a mouthful, but the architecture is interesting. It's a Mixture-of-Experts model with 35 billion total parameters, but it only activates about 3 billion per inference. That's small enough to run on a single consumer GPU.
The model is designed for agentic coding, so that means it can write code, but also use tools, browse the web, and run terminal commands in a loop to complete complex tasks. You give it a goal, and it figures out the steps, it's not just autocomplete on steroids.
The Numbers
On SWE-bench Verified, it shows strong performance. That's competitive with Google's Gemma4-31B which is a much larger, dense model. It also beats its elder sibling Qwen3.5-35B-A3B by a decent margin. It's not the best out there, but it's solid for something you can run locally.
The model can understand images too, not just text. On tests measuring both vision and language skills, it matches or approaches Claude Sonnet 4.5. That's notable because Claude is a much larger and more expensive model. The context window (how much text or code you can feed it at once), is 256,000 tokens by default, roughly a 500-page book. With a few adjustments, you can push it past a million tokens.
Where to Run It
The model weights are on Hugging Face and ModelScope, fully open too. You can also try it for free on Qwen Studio if you just want to poke around before downloading 70GB.
For API users, Alibaba is offering qwen3.6-flash, which includes a preserve_thinking parameter that keeps the model's chain-of-thought in the conversation history. Very useful for debugging or for multi-step agentic workflows where you want to see why it made a particular decision.
Alibaba says more Qwen3.6 models are coming soon, including a flagship called Qwen3.6-Max. No word yet on whether those will also be open source. We'll see.
Tags
Join the Discussion
Enjoyed this? Ask questions, share your take (hot, lukewarm, or undecided), or follow the thread with people in real time. The community’s open, join us.
Latest in Dev Digest

Alibaba Just Open-Sourced a Coding AI That Runs on a Single GPU
Apr 17, 2026

Copilot Ads in Pull Requests? GitHub Backtracks After Backlash
Mar 30, 2026

Ubuntu 26.04 Finally Shows Asterisks When Typing Your Sudo Password
Mar 21, 2026

OpenClaw Tops GitHub Star Rankings, Surpasses React
Mar 2, 2026

React Leaves Meta, Gets a New Home
Feb 25, 2026
Right Now in Tech

PS5 Price Hike: $650 for Standard, $900 for Pro Starting April 2
Mar 28, 2026

Apple Discontinues Mac Pro, Ends Intel Era
Mar 27, 2026

OpenAI Is Pulling the Plug on Sora
Mar 26, 2026

Meta and YouTube Ordered to Pay $3M in Landmark Social Media Ruling
Mar 25, 2026

Your Galaxy S26 Can Finally AirDrop to an iPhone
Mar 23, 2026