Small

bigscience-small-testing

bigscience-small-testing specs, VRAM requirements, and which GPUs can run it.

bitnet-b1.58-2B-4T-bf16

bitnet-b1.58-2B-4T-bf16 specs, VRAM requirements, and which GPUs can run it.

bloom-1b1

bloom-1b1 specs, VRAM requirements, and which GPUs can run it.

bloom-1b7

bloom-1b7 specs, VRAM requirements, and which GPUs can run it.

bloom-560m

bloom-560m specs, VRAM requirements, and which GPUs can run it.

bloomz-1b7

bloomz-1b7 specs, VRAM requirements, and which GPUs can run it.

bloomz-560m

bloomz-560m specs, VRAM requirements, and which GPUs can run it.

Bolmo-1B

Bolmo-1B specs, VRAM requirements, and which GPUs can run it.

codegemma-2b

codegemma-2b specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

DialoGPT-small

DialoGPT-small specs, VRAM requirements, and which GPUs can run it.

distilgpt2

distilgpt2 specs, VRAM requirements, and which GPUs can run it.

ELM

ELM specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Base

Falcon-H1-0.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Instruct

Falcon-H1-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Base

Falcon-H1-1.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Instruct

Falcon-H1-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-90M-Instruct

Falcon-H1-Tiny-90M-Instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-tiny-dev

falcon-mamba-tiny-dev specs, VRAM requirements, and which GPUs can run it.

Falcon3-1B-Instruct

Falcon3-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-2b-it

gemma-1.1-2b-it specs, VRAM requirements, and which GPUs can run it.

gpt-neo-1.3B

gpt-neo-1.3B specs, VRAM requirements, and which GPUs can run it.

gpt-neo-125m

gpt-neo-125m specs, VRAM requirements, and which GPUs can run it.

gpt-neo-2.7B

gpt-neo-2.7B specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b-Eagle3-long-context

gpt-oss-120b-Eagle3-long-context specs, VRAM requirements, and which GPUs can run it.

gpt2

gpt2 specs, VRAM requirements, and which GPUs can run it.

gpt2-large

gpt2-large specs, VRAM requirements, and which GPUs can run it.

gpt2-medium

gpt2-medium specs, VRAM requirements, and which GPUs can run it.

gpt2-mini

gpt2-mini specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-2b

h2ovl-mississippi-2b specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-800m

h2ovl-mississippi-800m specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-1_8b

internlm2-chat-1_8b specs, VRAM requirements, and which GPUs can run it.

japanese-gpt-neox-small

japanese-gpt-neox-small specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct

LFM2.5-1.2B-Instruct specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-4bit

LFM2.5-1.2B-Instruct-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-6bit

LFM2.5-1.2B-Instruct-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-8bit

LFM2.5-1.2B-Instruct-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B

Llama-3.2-1B specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8

Llama-3.2-1B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8-dynamic

Llama-3.2-1B-Instruct-FP8-dynamic specs, VRAM requirements, and which GPUs can run it.

llama-300M-v3-original

llama-300M-v3-original specs, VRAM requirements, and which GPUs can run it.

Nemotron-Flash-3B

Nemotron-Flash-3B specs, VRAM requirements, and which GPUs can run it.

OLMo-1B

OLMo-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-0724-hf

OLMo-1B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-hf

OLMo-1B-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B

OLMo-2-0425-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-Instruct

OLMo-2-0425-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-RLVR1

OLMo-2-0425-1B-RLVR1 specs, VRAM requirements, and which GPUs can run it.

phi-1

phi-1 specs, VRAM requirements, and which GPUs can run it.

phi-1_5

phi-1_5 specs, VRAM requirements, and which GPUs can run it.

phi-2

phi-2 specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-1.3b

polyglot-ko-1.3b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b

pythia-1.4b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b-deduped

pythia-1.4b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-14m

pythia-14m specs, VRAM requirements, and which GPUs can run it.

pythia-14m-deduped

pythia-14m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-deduped

pythia-160m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-seed1

pythia-160m-seed1 specs, VRAM requirements, and which GPUs can run it.

pythia-1b

pythia-1b specs, VRAM requirements, and which GPUs can run it.

pythia-2.8b-deduped

pythia-2.8b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-31m

pythia-31m specs, VRAM requirements, and which GPUs can run it.

pythia-31m-deduped

pythia-31m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m

pythia-410m specs, VRAM requirements, and which GPUs can run it.

pythia-410m-deduped

pythia-410m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m-v0

pythia-410m-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-70m-deduped

pythia-70m-deduped specs, VRAM requirements, and which GPUs can run it.

Qwen2-0.5B-Instruct

Qwen2-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-1.5B-Instruct

Qwen2-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B

Qwen2.5-0.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B-Instruct

Qwen2.5-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B

Qwen2.5-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct

Qwen2.5-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct-AWQ

Qwen2.5-1.5B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-quantized.w8a8

Qwen2.5-1.5B-quantized.w8a8 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-0.5B-Instruct

Qwen2.5-Coder-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-1.5B-Instruct

Qwen2.5-Coder-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Math-1.5B

Qwen2.5-Math-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B

Qwen3-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B-FP8

Qwen3-0.6B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-1.7B-Base

Qwen3-1.7B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-0.6B

Qwen3Guard-Gen-0.6B specs, VRAM requirements, and which GPUs can run it.

recurrentgemma-2b

recurrentgemma-2b specs, VRAM requirements, and which GPUs can run it.

SmolLM-135M-Instruct

SmolLM-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M

SmolLM2-135M specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M-Instruct

SmolLM2-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

stablelm-2-1_6b

stablelm-2-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-2-zephyr-1_6b

stablelm-2-zephyr-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-3b-4e1t

stablelm-3b-4e1t specs, VRAM requirements, and which GPUs can run it.

stablelm-zephyr-3b

stablelm-zephyr-3b specs, VRAM requirements, and which GPUs can run it.

stories15M_MOE

stories15M_MOE specs, VRAM requirements, and which GPUs can run it.

tiny-random-Gemma2ForCausalLM

tiny-random-Gemma2ForCausalLM specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v0.3-GPTQ

TinyLlama-1.1B-Chat-v0.3-GPTQ specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v1.0

TinyLlama-1.1B-Chat-v1.0 specs, VRAM requirements, and which GPUs can run it.

txgemma-2b-predict

txgemma-2b-predict specs, VRAM requirements, and which GPUs can run it.

vaultgemma-1b

vaultgemma-1b specs, VRAM requirements, and which GPUs can run it.