Llm
AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Large-1.5 specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.
Athene-70B-Preview specs, VRAM requirements, and which GPUs can run it.
Athene-V2-Agent specs, VRAM requirements, and which GPUs can run it.
bigscience-small-testing specs, VRAM requirements, and which GPUs can run it.
bitnet-b1.58-2B-4T-bf16 specs, VRAM requirements, and which GPUs can run it.
bloom-1b1 specs, VRAM requirements, and which GPUs can run it.
bloom-1b7 specs, VRAM requirements, and which GPUs can run it.
bloom-3b specs, VRAM requirements, and which GPUs can run it.
bloom-560m specs, VRAM requirements, and which GPUs can run it.
bloom-7b1 specs, VRAM requirements, and which GPUs can run it.
bloomz specs, VRAM requirements, and which GPUs can run it.
bloomz-1b7 specs, VRAM requirements, and which GPUs can run it.
bloomz-3b specs, VRAM requirements, and which GPUs can run it.
bloomz-560m specs, VRAM requirements, and which GPUs can run it.
bloomz-7b1 specs, VRAM requirements, and which GPUs can run it.
bloomz-7b1-mt specs, VRAM requirements, and which GPUs can run it.
Bolmo-1B specs, VRAM requirements, and which GPUs can run it.
codegemma-2b specs, VRAM requirements, and which GPUs can run it.
CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.
CodeLlama-7b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.
deep-ignorance-unfiltered specs, VRAM requirements, and which GPUs can run it.
DeepSeek R1 Distill 14B specs, VRAM requirements, and which GPUs can run it. Reasoning-focused model that punches above its weight.
deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-6.7b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-6.7b-instruct specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-7b-base-v1.5 specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-7b-instruct-v1.5 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-Coder-V2-Instruct specs, VRAM requirements, and which GPUs can run it.
DeepSeek-Coder-V2-Instruct-0724 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.
deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528-NVFP4-v2 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528-Qwen3-8B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528-Qwen3-8B-MLX-4bit specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528-Qwen3-8B-MLX-8bit specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-7B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2 specs, VRAM requirements, and which GPUs can run it.
Deepseek-V2 Pro specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Chat specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Chat-0628 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2.5 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3-0324 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3-0324-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3.1-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3.2 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3.2-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DialoGPT-small specs, VRAM requirements, and which GPUs can run it.
distilgpt2 specs, VRAM requirements, and which GPUs can run it.
dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.
Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.
ELM specs, VRAM requirements, and which GPUs can run it.
falcon-11B specs, VRAM requirements, and which GPUs can run it.
falcon-7b-instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-0.5B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-1.5B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-3B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-3B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-7B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-Tiny-90M-Instruct specs, VRAM requirements, and which GPUs can run it.
falcon-mamba-7b-instruct specs, VRAM requirements, and which GPUs can run it.
falcon-mamba-tiny-dev specs, VRAM requirements, and which GPUs can run it.
Falcon3-10B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon3-1B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon3-3B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon3-3B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon3-7B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Flex-reddit-2x7B-1T specs, VRAM requirements, and which GPUs can run it.
gemma-1.1-2b-it specs, VRAM requirements, and which GPUs can run it.
gemma-1.1-7b-it specs, VRAM requirements, and which GPUs can run it.
gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.
gemma-2-9b-it specs, VRAM requirements, and which GPUs can run it.
GLM-4.7-Flash-MLX-6bit specs, VRAM requirements, and which GPUs can run it.
GLM-4.7-Flash-MLX-8bit specs, VRAM requirements, and which GPUs can run it.
gpt-neo-1.3B specs, VRAM requirements, and which GPUs can run it.
gpt-neo-125m specs, VRAM requirements, and which GPUs can run it.
gpt-neo-2.7B specs, VRAM requirements, and which GPUs can run it.
gpt-oss-120b specs, VRAM requirements, and which GPUs can run it.
gpt-oss-120b-Eagle3-long-context specs, VRAM requirements, and which GPUs can run it.
gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.
gpt2 specs, VRAM requirements, and which GPUs can run it.
gpt2-large specs, VRAM requirements, and which GPUs can run it.
gpt2-medium specs, VRAM requirements, and which GPUs can run it.
gpt2-mini specs, VRAM requirements, and which GPUs can run it.
h2ovl-mississippi-2b specs, VRAM requirements, and which GPUs can run it.
h2ovl-mississippi-800m specs, VRAM requirements, and which GPUs can run it.
Hermes-2-Pro-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.
Hermes-2-Pro-Mistral-7B specs, VRAM requirements, and which GPUs can run it.
Hermes-2-Theta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.
Hermes-3-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.
Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.
internlm2_5-7b specs, VRAM requirements, and which GPUs can run it.
internlm2-chat-1_8b specs, VRAM requirements, and which GPUs can run it.
internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.
internlm2-chat-7b-sft specs, VRAM requirements, and which GPUs can run it.
Jan-v3-4B-base-instruct specs, VRAM requirements, and which GPUs can run it.
japanese-gpt-neox-small specs, VRAM requirements, and which GPUs can run it.
LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.
LFM2-8B-A1B specs, VRAM requirements, and which GPUs can run it.
LFM2.5-1.2B-Instruct specs, VRAM requirements, and which GPUs can run it.
LFM2.5-1.2B-Instruct-MLX-4bit specs, VRAM requirements, and which GPUs can run it.
LFM2.5-1.2B-Instruct-MLX-6bit specs, VRAM requirements, and which GPUs can run it.
LFM2.5-1.2B-Instruct-MLX-8bit specs, VRAM requirements, and which GPUs can run it.
Llama 3.1 70B specs, VRAM requirements, and which GPUs can run it. The sweet spot for local reasoning.
Llama 3.1 8B specs, VRAM requirements, and which GPUs can run it. The go-to small model for local inference.
Llama-2-7b-hf specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-405B-Instruct specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-405B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-Tulu-3-8B-SFT specs, VRAM requirements, and which GPUs can run it.
Llama-3.2-1B specs, VRAM requirements, and which GPUs can run it.
Llama-3.2-1B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.2-1B-Instruct-FP8-dynamic specs, VRAM requirements, and which GPUs can run it.
Llama-3.2-3B specs, VRAM requirements, and which GPUs can run it.
llama-3.3-70b-instruct-awq specs, VRAM requirements, and which GPUs can run it.
llama-300M-v3-original specs, VRAM requirements, and which GPUs can run it.
Llama-Guard-3-8B specs, VRAM requirements, and which GPUs can run it.
Llama-Guard-3-8B-INT8 specs, VRAM requirements, and which GPUs can run it.
LlamaGuard-7b specs, VRAM requirements, and which GPUs can run it.
llm-jp-3-3.7b-instruct specs, VRAM requirements, and which GPUs can run it.
LocoOperator-4B specs, VRAM requirements, and which GPUs can run it.
maira-2 specs, VRAM requirements, and which GPUs can run it.
MediPhi-Clinical specs, VRAM requirements, and which GPUs can run it.
MediPhi-Instruct specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3-70B-Instruct specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3-8B-Instruct specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3.1-8B-Instruct specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3.1-8B-Instruct-bnb-4bit specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-Guard-2-8B specs, VRAM requirements, and which GPUs can run it.
MiniMax-M2-AWQ specs, VRAM requirements, and which GPUs can run it.
MiniMax-M2.5 specs, VRAM requirements, and which GPUs can run it.
Mistral 7B specs, VRAM requirements, and which GPUs can run it. Efficient and fast for everyday tasks.
Mistral-7B-Instruct-v0.2 specs, VRAM requirements, and which GPUs can run it.
mistral-7b-v0.3-bnb-4bit specs, VRAM requirements, and which GPUs can run it.
Mistral-NeMo-Minitron-8B-Instruct specs, VRAM requirements, and which GPUs can run it.
Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.
Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.
Nanbeige4.1-3B specs, VRAM requirements, and which GPUs can run it.
Nanbeige4.1-3B-heretic specs, VRAM requirements, and which GPUs can run it.
Nemotron-Flash-3B specs, VRAM requirements, and which GPUs can run it.
Nemotron-H-4B-Base-8K specs, VRAM requirements, and which GPUs can run it.
Nemotron-H-4B-Instruct-128K specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-2-Mistral-7B-DPO specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-2-SOLAR-10.7B specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-llama-2-7b specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-Nano-9B-v2 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-Nano-9B-v2-Base specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-Nano-9B-v2-FP8 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-Nano-9B-v2-Japanese specs, VRAM requirements, and which GPUs can run it.
OLMo-1B specs, VRAM requirements, and which GPUs can run it.
OLMo-1B-0724-hf specs, VRAM requirements, and which GPUs can run it.
OLMo-1B-hf specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0425-1B specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0425-1B-Instruct specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0425-1B-RLVR1 specs, VRAM requirements, and which GPUs can run it.
OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.
OLMo-2-1124-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Olmo-3-1025-7B specs, VRAM requirements, and which GPUs can run it.
Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.
Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.
Olmo-3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Olmo-3-7B-Instruct-DPO specs, VRAM requirements, and which GPUs can run it.
Olmo-3-7B-Instruct-SFT specs, VRAM requirements, and which GPUs can run it.
Olmo-3-7B-Think specs, VRAM requirements, and which GPUs can run it.
Olmo-3-7B-Think-DPO specs, VRAM requirements, and which GPUs can run it.
Olmo-3-7B-Think-SFT specs, VRAM requirements, and which GPUs can run it.
Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.
Olmo-3.1-7B-RL-Zero-Math specs, VRAM requirements, and which GPUs can run it.
OLMo-7B-0724-hf specs, VRAM requirements, and which GPUs can run it.
OLMo-7B-hf specs, VRAM requirements, and which GPUs can run it.
Olmo-Hybrid-Instruct-DPO-7B specs, VRAM requirements, and which GPUs can run it.
OLMoE-1B-7B-0125 specs, VRAM requirements, and which GPUs can run it.
OLMoE-1B-7B-0125-Instruct specs, VRAM requirements, and which GPUs can run it.
OLMoE-1B-7B-0924-Instruct specs, VRAM requirements, and which GPUs can run it.
phi-1 specs, VRAM requirements, and which GPUs can run it.
phi-1_5 specs, VRAM requirements, and which GPUs can run it.
phi-2 specs, VRAM requirements, and which GPUs can run it.
Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.
Phi-3-mini-4k-instruct-gptq-4bit specs, VRAM requirements, and which GPUs can run it.
Phi-3-small-8k-instruct specs, VRAM requirements, and which GPUs can run it.
Phi-mini-MoE-instruct specs, VRAM requirements, and which GPUs can run it.
Phi-tiny-MoE-instruct specs, VRAM requirements, and which GPUs can run it.
polyglot-ko-1.3b specs, VRAM requirements, and which GPUs can run it.
polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.
polyglot-ko-5.8b specs, VRAM requirements, and which GPUs can run it.
pythia-1.4b specs, VRAM requirements, and which GPUs can run it.
pythia-1.4b-deduped specs, VRAM requirements, and which GPUs can run it.
pythia-12b specs, VRAM requirements, and which GPUs can run it.
pythia-14m specs, VRAM requirements, and which GPUs can run it.
pythia-14m-deduped specs, VRAM requirements, and which GPUs can run it.
pythia-160m-deduped specs, VRAM requirements, and which GPUs can run it.
pythia-160m-seed1 specs, VRAM requirements, and which GPUs can run it.
pythia-1b specs, VRAM requirements, and which GPUs can run it.
pythia-2.8b-deduped specs, VRAM requirements, and which GPUs can run it.
pythia-31m specs, VRAM requirements, and which GPUs can run it.
pythia-31m-deduped specs, VRAM requirements, and which GPUs can run it.
pythia-410m specs, VRAM requirements, and which GPUs can run it.
pythia-410m-deduped specs, VRAM requirements, and which GPUs can run it.
pythia-410m-v0 specs, VRAM requirements, and which GPUs can run it.
pythia-6.9b specs, VRAM requirements, and which GPUs can run it.
pythia-70m-deduped specs, VRAM requirements, and which GPUs can run it.
Qwen 2.5 72B specs, VRAM requirements, and which GPUs can run it. Strong on benchmarks, competitive with Llama 70B.
Qwen 2.5 72B Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen1.5-110B-Chat-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2 72B specs, VRAM requirements, and which GPUs can run it.
Qwen2-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-0.5B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-1.5B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-1.5B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-1.5B-quantized.w8a8 specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-3B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-3B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-72B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-72B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-7B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-7B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-7B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Math-1.5B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-VL-7B-Instruct-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-0.6B specs, VRAM requirements, and which GPUs can run it.
Qwen3-0.6B-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-1.7B-Base specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-235B-A22B specs, VRAM requirements, and which GPUs can run it.
Qwen3-235B-A22B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-235B-A22B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-4B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-4B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-4B-SafeRL specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-Base specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-Base specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Next-80B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen3-Next-80B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-4B-Safety-Thinking specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-9B-abliterated specs, VRAM requirements, and which GPUs can run it.
Qwen3Guard-Gen-0.6B specs, VRAM requirements, and which GPUs can run it.
Qwen3Guard-Gen-4B specs, VRAM requirements, and which GPUs can run it.
Qwen3Guard-Gen-8B specs, VRAM requirements, and which GPUs can run it.
QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
recurrentgemma-2b specs, VRAM requirements, and which GPUs can run it.
saiga_llama3_8b specs, VRAM requirements, and which GPUs can run it.
SmolLM-135M-Instruct specs, VRAM requirements, and which GPUs can run it.
SmolLM2-135M specs, VRAM requirements, and which GPUs can run it.
SmolLM2-135M-Instruct specs, VRAM requirements, and which GPUs can run it.
SOLAR-10.7B-v1.0 specs, VRAM requirements, and which GPUs can run it.
StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.
stablelm-2-1_6b specs, VRAM requirements, and which GPUs can run it.
stablelm-2-zephyr-1_6b specs, VRAM requirements, and which GPUs can run it.
stablelm-3b-4e1t specs, VRAM requirements, and which GPUs can run it.
stablelm-base-alpha-7b-v2 specs, VRAM requirements, and which GPUs can run it.
stablelm-zephyr-3b specs, VRAM requirements, and which GPUs can run it.
starchat-alpha specs, VRAM requirements, and which GPUs can run it.
Starling-LM-7B-beta specs, VRAM requirements, and which GPUs can run it.
steerling-8b specs, VRAM requirements, and which GPUs can run it.
Step-3.5-Flash specs, VRAM requirements, and which GPUs can run it.
stories15M_MOE specs, VRAM requirements, and which GPUs can run it.
Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.
tiny-aya-global specs, VRAM requirements, and which GPUs can run it.
tiny-random-Gemma2ForCausalLM specs, VRAM requirements, and which GPUs can run it.
TinyLlama-1.1B-Chat-v0.3-GPTQ specs, VRAM requirements, and which GPUs can run it.
TinyLlama-1.1B-Chat-v1.0 specs, VRAM requirements, and which GPUs can run it.
tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.
txgemma-2b-predict specs, VRAM requirements, and which GPUs can run it.
vaultgemma-1b specs, VRAM requirements, and which GPUs can run it.
wildguard specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-6B specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-6B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-9B specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-9B-32K specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-9B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-9B-Chat-16K specs, VRAM requirements, and which GPUs can run it.
Yi-6B specs, VRAM requirements, and which GPUs can run it.
Yi-6B-200K specs, VRAM requirements, and which GPUs can run it.
Yi-6B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-9B specs, VRAM requirements, and which GPUs can run it.
Yi-9B-200K specs, VRAM requirements, and which GPUs can run it.
Yi-Coder-9B specs, VRAM requirements, and which GPUs can run it.
Yi-Coder-9B-Chat specs, VRAM requirements, and which GPUs can run it.
zephyr-7b-beta specs, VRAM requirements, and which GPUs can run it.