Text-Generation

AceReason-Nemotron-14B

AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Large-1.5

AI21-Jamba-Large-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.5

AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.6

AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.

Athene-70B-Preview

Athene-70B-Preview specs, VRAM requirements, and which GPUs can run it.

Athene-V2-Agent

Athene-V2-Agent specs, VRAM requirements, and which GPUs can run it.

bigscience-small-testing

bigscience-small-testing specs, VRAM requirements, and which GPUs can run it.

bitnet-b1.58-2B-4T-bf16

bitnet-b1.58-2B-4T-bf16 specs, VRAM requirements, and which GPUs can run it.

bloom-1b1

bloom-1b1 specs, VRAM requirements, and which GPUs can run it.

bloom-1b7

bloom-1b7 specs, VRAM requirements, and which GPUs can run it.

bloom-3b

bloom-3b specs, VRAM requirements, and which GPUs can run it.

bloom-560m

bloom-560m specs, VRAM requirements, and which GPUs can run it.

bloom-7b1

bloom-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz

bloomz specs, VRAM requirements, and which GPUs can run it.

bloomz-1b7

bloomz-1b7 specs, VRAM requirements, and which GPUs can run it.

bloomz-3b

bloomz-3b specs, VRAM requirements, and which GPUs can run it.

bloomz-560m

bloomz-560m specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1

bloomz-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1-mt

bloomz-7b1-mt specs, VRAM requirements, and which GPUs can run it.

Bolmo-1B

Bolmo-1B specs, VRAM requirements, and which GPUs can run it.

codegemma-2b

codegemma-2b specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-Instruct-hf

CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-7b-Instruct-hf

CodeLlama-7b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

deep-ignorance-unfiltered

deep-ignorance-unfiltered specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-base

deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-instruct

deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-base

deepseek-coder-6.7b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-instruct

deepseek-coder-6.7b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-base-v1.5

deepseek-coder-7b-base-v1.5 specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-instruct-v1.5

deepseek-coder-7b-instruct-v1.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct

DeepSeek-Coder-V2-Instruct specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct-0724

DeepSeek-Coder-V2-Instruct-0724 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Base

DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-base

deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-chat

deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528

DeepSeek-R1-0528 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4

DeepSeek-R1-0528-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4-v2

DeepSeek-R1-0528-NVFP4-v2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B

DeepSeek-R1-0528-Qwen3-8B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Qwen-7B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-NVFP4

DeepSeek-R1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2

DeepSeek-V2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat

DeepSeek-V2-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat-0628

DeepSeek-V2-Chat-0628 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite

DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite-Chat

DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2.5

DeepSeek-V2.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324

DeepSeek-V3-0324 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324-NVFP4

DeepSeek-V3-0324-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.1-NVFP4

DeepSeek-V3.1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2-NVFP4

DeepSeek-V3.2-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DialoGPT-small

DialoGPT-small specs, VRAM requirements, and which GPUs can run it.

distilgpt2

distilgpt2 specs, VRAM requirements, and which GPUs can run it.

dolphin-2.9.1-yi-1.5-34b

dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.

Dolphin-Mistral-24B-Venice-Edition

Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.

ELM

ELM specs, VRAM requirements, and which GPUs can run it.

falcon-11B

falcon-11B specs, VRAM requirements, and which GPUs can run it.

falcon-7b-instruct

falcon-7b-instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Base

Falcon-H1-0.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Instruct

Falcon-H1-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Base

Falcon-H1-1.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Instruct

Falcon-H1-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Base

Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct

Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Base

Falcon-H1-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Instruct

Falcon-H1-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Base

Falcon-H1-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Instruct

Falcon-H1-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-90M-Instruct

Falcon-H1-Tiny-90M-Instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-7b-instruct

falcon-mamba-7b-instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-tiny-dev

falcon-mamba-tiny-dev specs, VRAM requirements, and which GPUs can run it.

Falcon3-10B-Base

Falcon3-10B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-1B-Instruct

Falcon3-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Base

Falcon3-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Instruct

Falcon3-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Base

Falcon3-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Instruct

Falcon3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Flex-reddit-2x7B-1T

Flex-reddit-2x7B-1T specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-2b-it

gemma-1.1-2b-it specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-7b-it

gemma-1.1-7b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-27b-it

gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-9b-it

gemma-2-9b-it specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-6bit

GLM-4.7-Flash-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-8bit

GLM-4.7-Flash-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

gpt-neo-1.3B

gpt-neo-1.3B specs, VRAM requirements, and which GPUs can run it.

gpt-neo-125m

gpt-neo-125m specs, VRAM requirements, and which GPUs can run it.

gpt-neo-2.7B

gpt-neo-2.7B specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b

gpt-oss-120b specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b-Eagle3-long-context

gpt-oss-120b-Eagle3-long-context specs, VRAM requirements, and which GPUs can run it.

gpt-oss-20b

gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.

gpt2

gpt2 specs, VRAM requirements, and which GPUs can run it.

gpt2-large

gpt2-large specs, VRAM requirements, and which GPUs can run it.

gpt2-medium

gpt2-medium specs, VRAM requirements, and which GPUs can run it.

gpt2-mini

gpt2-mini specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-2b

h2ovl-mississippi-2b specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-800m

h2ovl-mississippi-800m specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Llama-3-8B

Hermes-2-Pro-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Mistral-7B

Hermes-2-Pro-Mistral-7B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Theta-Llama-3-8B

Hermes-2-Theta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-3-Llama-3.1-8B

Hermes-3-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-4-14B

Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.

internlm2_5-7b

internlm2_5-7b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-1_8b

internlm2-chat-1_8b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-20b

internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-7b-sft

internlm2-chat-7b-sft specs, VRAM requirements, and which GPUs can run it.

Jan-v3-4B-base-instruct

Jan-v3-4B-base-instruct specs, VRAM requirements, and which GPUs can run it.

japanese-gpt-neox-small

japanese-gpt-neox-small specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B

LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.

LFM2-8B-A1B

LFM2-8B-A1B specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct

LFM2.5-1.2B-Instruct specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-4bit

LFM2.5-1.2B-Instruct-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-6bit

LFM2.5-1.2B-Instruct-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-8bit

LFM2.5-1.2B-Instruct-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

Llama-2-7b-hf

Llama-2-7b-hf specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1

Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1-FP8

Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct

Llama-3.1-405B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct-FP8

Llama-3.1-405B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-70B-Instruct

Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-8B-Instruct-FP8

Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Tulu-3-8B-SFT

Llama-3.1-Tulu-3-8B-SFT specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B

Llama-3.2-1B specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8

Llama-3.2-1B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8-dynamic

Llama-3.2-1B-Instruct-FP8-dynamic specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-3B

Llama-3.2-3B specs, VRAM requirements, and which GPUs can run it.

llama-3.3-70b-instruct-awq

llama-3.3-70b-instruct-awq specs, VRAM requirements, and which GPUs can run it.

llama-300M-v3-original

llama-300M-v3-original specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B

Llama-Guard-3-8B specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B-INT8

Llama-Guard-3-8B-INT8 specs, VRAM requirements, and which GPUs can run it.

LlamaGuard-7b

LlamaGuard-7b specs, VRAM requirements, and which GPUs can run it.

llm-jp-3-3.7b-instruct

llm-jp-3-3.7b-instruct specs, VRAM requirements, and which GPUs can run it.

LocoOperator-4B

LocoOperator-4B specs, VRAM requirements, and which GPUs can run it.

maira-2

maira-2 specs, VRAM requirements, and which GPUs can run it.

MediPhi-Clinical

MediPhi-Clinical specs, VRAM requirements, and which GPUs can run it.

MediPhi-Instruct

MediPhi-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-70B-Instruct

Meta-Llama-3-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B

Meta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B-Instruct

Meta-Llama-3-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-70B-Instruct

Meta-Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B

Meta-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct

Meta-Llama-3.1-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-bnb-4bit

Meta-Llama-3.1-8B-Instruct-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-FP8

Meta-Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-Guard-2-8B

Meta-Llama-Guard-2-8B specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2-AWQ

MiniMax-M2-AWQ specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2.5

MiniMax-M2.5 specs, VRAM requirements, and which GPUs can run it.

Mistral-7B-Instruct-v0.2

Mistral-7B-Instruct-v0.2 specs, VRAM requirements, and which GPUs can run it.

mistral-7b-v0.3-bnb-4bit

mistral-7b-v0.3-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Mistral-NeMo-Minitron-8B-Instruct

Mistral-NeMo-Minitron-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Mistral-Small-24B-Instruct-2501-AWQ

Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.

Mixtral-8x7B-Instruct-v0.1-GPTQ

Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B

Nanbeige4.1-3B specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B-heretic

Nanbeige4.1-3B-heretic specs, VRAM requirements, and which GPUs can run it.

Nemotron-Flash-3B

Nemotron-Flash-3B specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Base-8K

Nemotron-H-4B-Base-8K specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Instruct-128K

Nemotron-H-4B-Instruct-128K specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mistral-7B-DPO

Nous-Hermes-2-Mistral-7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mixtral-8x7B-DPO

Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-SOLAR-10.7B

Nous-Hermes-2-SOLAR-10.7B specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-llama-2-7b

Nous-Hermes-llama-2-7b specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2

NVIDIA-Nemotron-Nano-9B-v2 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Base

NVIDIA-Nemotron-Nano-9B-v2-Base specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-FP8

NVIDIA-Nemotron-Nano-9B-v2-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Japanese

NVIDIA-Nemotron-Nano-9B-v2-Japanese specs, VRAM requirements, and which GPUs can run it.

OLMo-1B

OLMo-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-0724-hf

OLMo-1B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-hf

OLMo-1B-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B

OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B-Instruct

OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B

OLMo-2-0425-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-Instruct

OLMo-2-0425-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-RLVR1

OLMo-2-0425-1B-RLVR1 specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-Instruct

OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-7B-Instruct

OLMo-2-1124-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1025-7B

Olmo-3-1025-7B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1125-32B

Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think

Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct

Olmo-3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-DPO

Olmo-3-7B-Instruct-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-SFT

Olmo-3-7B-Instruct-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think

Olmo-3-7B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-DPO

Olmo-3-7B-Think-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-SFT

Olmo-3-7B-Think-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-32B-Think

Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-7B-RL-Zero-Math

Olmo-3.1-7B-RL-Zero-Math specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-0724-hf

OLMo-7B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-hf

OLMo-7B-hf specs, VRAM requirements, and which GPUs can run it.

Olmo-Hybrid-Instruct-DPO-7B

Olmo-Hybrid-Instruct-DPO-7B specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125

OLMoE-1B-7B-0125 specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125-Instruct

OLMoE-1B-7B-0125-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0924-Instruct

OLMoE-1B-7B-0924-Instruct specs, VRAM requirements, and which GPUs can run it.

phi-1

phi-1 specs, VRAM requirements, and which GPUs can run it.

phi-1_5

phi-1_5 specs, VRAM requirements, and which GPUs can run it.

phi-2

phi-2 specs, VRAM requirements, and which GPUs can run it.

Phi-3-medium-4k-instruct

Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-3-mini-4k-instruct-gptq-4bit

Phi-3-mini-4k-instruct-gptq-4bit specs, VRAM requirements, and which GPUs can run it.

Phi-3-small-8k-instruct

Phi-3-small-8k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-mini-MoE-instruct

Phi-mini-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-tiny-MoE-instruct

Phi-tiny-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-1.3b

polyglot-ko-1.3b specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-12.8b

polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-5.8b

polyglot-ko-5.8b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b

pythia-1.4b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b-deduped

pythia-1.4b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-12b

pythia-12b specs, VRAM requirements, and which GPUs can run it.

pythia-14m

pythia-14m specs, VRAM requirements, and which GPUs can run it.

pythia-14m-deduped

pythia-14m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-deduped

pythia-160m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-seed1

pythia-160m-seed1 specs, VRAM requirements, and which GPUs can run it.

pythia-1b

pythia-1b specs, VRAM requirements, and which GPUs can run it.

pythia-2.8b-deduped

pythia-2.8b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-31m

pythia-31m specs, VRAM requirements, and which GPUs can run it.

pythia-31m-deduped

pythia-31m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m

pythia-410m specs, VRAM requirements, and which GPUs can run it.

pythia-410m-deduped

pythia-410m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m-v0

pythia-410m-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-6.9b

pythia-6.9b specs, VRAM requirements, and which GPUs can run it.

pythia-70m-deduped

pythia-70m-deduped specs, VRAM requirements, and which GPUs can run it.

Qwen1.5-110B-Chat-AWQ

Qwen1.5-110B-Chat-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2-0.5B-Instruct

Qwen2-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-1.5B-Instruct

Qwen2-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-7B-Instruct

Qwen2-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B

Qwen2.5-0.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B-Instruct

Qwen2.5-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B

Qwen2.5-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct

Qwen2.5-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct-AWQ

Qwen2.5-1.5B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-quantized.w8a8

Qwen2.5-1.5B-quantized.w8a8 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-14B-Instruct-AWQ

Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B

Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-AWQ

Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B

Qwen2.5-3B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B-Instruct

Qwen2.5-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct

Qwen2.5-72B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct-AWQ

Qwen2.5-72B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B

Qwen2.5-7B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B-Instruct

Qwen2.5-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-0.5B-Instruct

Qwen2.5-Coder-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-1.5B-Instruct

Qwen2.5-Coder-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-14B-Instruct

Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct

Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct-AWQ

Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct

Qwen2.5-Coder-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-AWQ

Qwen2.5-Coder-7B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Math-1.5B

Qwen2.5-Math-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-VL-7B-Instruct-NVFP4

Qwen2.5-VL-7B-Instruct-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B

Qwen3-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B-FP8

Qwen3-0.6B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-1.7B-Base

Qwen3-1.7B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-Instruct

Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-NVFP4

Qwen3-14B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B

Qwen3-235B-A22B specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-Instruct-2507-FP8

Qwen3-235B-A22B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-NVFP4

Qwen3-235B-A22B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Instruct-2507-FP8

Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-NVFP4

Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-AWQ

Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-NVFP4

Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-AWQ

Qwen3-4B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Instruct-2507-FP8

Qwen3-4B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-SafeRL

Qwen3-4B-SafeRL specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-AWQ

Qwen3-8B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-Base

Qwen3-8B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-FP8

Qwen3-8B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-NVFP4

Qwen3-8B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct-FP8

Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next

Qwen3-Coder-Next specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-8bit

Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-AWQ-4bit

Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-Base

Qwen3-Coder-Next-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-FP8

Qwen3-Coder-Next-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct

Qwen3-Next-80B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct-FP8

Qwen3-Next-80B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-VL-30B-A3B-Instruct-AWQ

Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Text-NVFP4-MTP

Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-4B-Safety-Thinking

Qwen3.5-4B-Safety-Thinking specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-9B-abliterated

Qwen3.5-9B-abliterated specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-0.6B

Qwen3Guard-Gen-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-4B

Qwen3Guard-Gen-4B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-8B

Qwen3Guard-Gen-8B specs, VRAM requirements, and which GPUs can run it.

QwQ-32B-AWQ

QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

recurrentgemma-2b

recurrentgemma-2b specs, VRAM requirements, and which GPUs can run it.

saiga_llama3_8b

saiga_llama3_8b specs, VRAM requirements, and which GPUs can run it.

SmolLM-135M-Instruct

SmolLM-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M

SmolLM2-135M specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M-Instruct

SmolLM2-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SOLAR-10.7B-v1.0

SOLAR-10.7B-v1.0 specs, VRAM requirements, and which GPUs can run it.

StableBeluga-13B

StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.

stablelm-2-1_6b

stablelm-2-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-2-zephyr-1_6b

stablelm-2-zephyr-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-3b-4e1t

stablelm-3b-4e1t specs, VRAM requirements, and which GPUs can run it.

stablelm-base-alpha-7b-v2

stablelm-base-alpha-7b-v2 specs, VRAM requirements, and which GPUs can run it.

stablelm-zephyr-3b

stablelm-zephyr-3b specs, VRAM requirements, and which GPUs can run it.

starchat-alpha

starchat-alpha specs, VRAM requirements, and which GPUs can run it.

Starling-LM-7B-beta

Starling-LM-7B-beta specs, VRAM requirements, and which GPUs can run it.

steerling-8b

steerling-8b specs, VRAM requirements, and which GPUs can run it.

Step-3.5-Flash

Step-3.5-Flash specs, VRAM requirements, and which GPUs can run it.

stories15M_MOE

stories15M_MOE specs, VRAM requirements, and which GPUs can run it.

Strand-Rust-Coder-14B-v1

Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.

tiny-aya-global

tiny-aya-global specs, VRAM requirements, and which GPUs can run it.

tiny-random-Gemma2ForCausalLM

tiny-random-Gemma2ForCausalLM specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v0.3-GPTQ

TinyLlama-1.1B-Chat-v0.3-GPTQ specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v1.0

TinyLlama-1.1B-Chat-v1.0 specs, VRAM requirements, and which GPUs can run it.

tulu-2-dpo-70b

tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.

txgemma-2b-predict

txgemma-2b-predict specs, VRAM requirements, and which GPUs can run it.

vaultgemma-1b

vaultgemma-1b specs, VRAM requirements, and which GPUs can run it.

wildguard

wildguard specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B

Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-32K

Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat

Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat-16K

Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B

Yi-1.5-6B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B-Chat

Yi-1.5-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B

Yi-1.5-9B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-32K

Yi-1.5-9B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat

Yi-1.5-9B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat-16K

Yi-1.5-9B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-6B

Yi-6B specs, VRAM requirements, and which GPUs can run it.

Yi-6B-200K

Yi-6B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-6B-Chat

Yi-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-9B

Yi-9B specs, VRAM requirements, and which GPUs can run it.

Yi-9B-200K

Yi-9B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B

Yi-Coder-9B specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B-Chat

Yi-Coder-9B-Chat specs, VRAM requirements, and which GPUs can run it.

zephyr-7b-beta

zephyr-7b-beta specs, VRAM requirements, and which GPUs can run it.