Large

AceReason-Nemotron-14B

AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.5

AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.6

AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-Instruct-hf

CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

DeepSeek R1 Distill 14B

DeepSeek R1 Distill 14B specs, VRAM requirements, and which GPUs can run it. Reasoning-focused model that punches above its weight.

deepseek-coder-33b-base

deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-instruct

deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Base

DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-base

deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-chat

deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite

DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite-Chat

DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.

dolphin-2.9.1-yi-1.5-34b

dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.

Dolphin-Mistral-24B-Venice-Edition

Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Base

Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct

Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.

gemma-2-27b-it

gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.

gpt-oss-20b

gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.

Hermes-4-14B

Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-20b

internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B

LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1

Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1-FP8

Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.

Mistral-Small-24B-Instruct-2501-AWQ

Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.

Mixtral-8x7B-Instruct-v0.1-GPTQ

Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mixtral-8x7B-DPO

Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B

OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B-Instruct

OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-Instruct

OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1125-32B

Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think

Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-32B-Think

Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.

Phi-3-medium-4k-instruct

Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-12.8b

polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-14B-Instruct-AWQ

Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B

Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-AWQ

Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-14B-Instruct

Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct

Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct-AWQ

Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-Instruct

Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Instruct-2507-FP8

Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-NVFP4

Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-AWQ

Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-NVFP4

Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct-FP8

Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-8bit

Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-AWQ-4bit

Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-VL-30B-A3B-Instruct-AWQ

Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Text-NVFP4-MTP

Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.

QwQ-32B-AWQ

QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

StableBeluga-13B

StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.

starchat-alpha

starchat-alpha specs, VRAM requirements, and which GPUs can run it.

Strand-Rust-Coder-14B-v1

Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.

tulu-2-dpo-70b

tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B

Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-32K

Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat

Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat-16K

Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.