LLM Models

Compare local LLM models — find which GPU you need to run them. VRAM requirements, quantization options, and hardware compatibility.

Showing 346 of 346 models
ModelDeveloperParamsContextMin VRAMUse CasesCompatible GPUs
AceReason-Nemotron-14Bnvidia14.77B4K9.75GBtext-generation108
AI21-Jamba-Large-1.5ai21labs398.56B4K263.04GBtext-generation0
AI21-Jamba-Mini-1.5ai21labs51.57B4K34.03GBtext-generation37
AI21-Jamba-Mini-1.6ai21labs51.57B4K34.03GBtext-generation37
Athene-70B-PreviewNexusflow70.55B4K46.56GBtext-generation36
Athene-V2-AgentNexusflow72.70B4K47.98GBtext-generation36
bigscience-small-testingbigscience0.02B4K0.01GBtext-generation169
bitnet-b1.58-2B-4T-bf16microsoft2.41B4K1.59GBtext-generation169
bloom-1b1bigscience1.07B4K0.7GBtext-generation169
bloom-1b7bigscience1.72B4K1.13GBtext-generation169
bloom-3bbigscience3.00B4K1.98GBtext-generation169
bloom-560mbigscience0.56B4K0.37GBtext-generation169
bloom-7b1bigscience7.07B4K4.66GBtext-generation163
bloomzbigscience176.25B4K116.33GBtext-generation5
bloomz-1b7bigscience1.72B4K1.13GBtext-generation169
bloomz-3bbigscience3.00B4K1.98GBtext-generation169
bloomz-560mbigscience0.56B4K0.37GBtext-generation169
bloomz-7b1bigscience7.07B4K4.66GBtext-generation163
bloomz-7b1-mtbigscience7.07B4K4.66GBtext-generation163
Bolmo-1Ballenai1.47B4K0.97GBtext-generation169
codegemma-2bgoogle2.51B4K1.65GBtext-generation169
CodeLlama-13b-Instruct-hfmeta-llama13.02B4K8.59GBtext-generation108
CodeLlama-7b-Instruct-hfmeta-llama6.74B4K4.44GBtext-generation163
deep-ignorance-unfilteredEleutherAI6.86B4K4.52GBtext-generation163
deepseek-coder-33b-basedeepseek-ai33.34B4K22.01GBtext-generation54
deepseek-coder-33b-instructdeepseek-ai33.34B4K22.01GBtext-generation54
deepseek-coder-6.7b-basedeepseek-ai6.74B4K4.44GBtext-generation163
deepseek-coder-6.7b-instructdeepseek-ai6.74B4K4.44GBtext-generation163
deepseek-coder-7b-base-v1.5deepseek-ai6.91B4K4.57GBtext-generation163
deepseek-coder-7b-instruct-v1.5deepseek-ai6.91B4K4.57GBtext-generation163
DeepSeek-Coder-V2-Instructdeepseek-ai235.74B4K155.58GBtext-generation1
DeepSeek-Coder-V2-Instruct-0724deepseek-ai235.74B4K155.58GBtext-generation1
DeepSeek-Coder-V2-Lite-Basedeepseek-ai15.71B4K10.36GBtext-generation106
deepseek-moe-16b-basedeepseek-ai16.38B4K10.81GBtext-generation106
deepseek-moe-16b-chatdeepseek-ai16.38B4K10.81GBtext-generation106
DeepSeek-R1-0528deepseek-ai684.53B4K451.79GBtext-generation0
DeepSeek-R1-0528-NVFP4nvidia396.77B4K261.87GBtext-generation0
DeepSeek-R1-0528-NVFP4-v2nvidia393.63B4K259.8GBtext-generation0
DeepSeek-R1-0528-Qwen3-8Bdeepseek-ai8.19B4K5.4GBtext-generation163
DeepSeek-R1-0528-Qwen3-8B-MLX-4bitlmstudio-community1.28B4K0.85GBtext-generation169
DeepSeek-R1-0528-Qwen3-8B-MLX-8bitlmstudio-community2.30B4K1.52GBtext-generation169
DeepSeek R1 Distill 14BDeepSeek14.00B65K9.5GBreasoning, math, coding, analysis108
DeepSeek-R1-Distill-Qwen-14Bdeepseek-ai14.77B4K9.75GBtext-generation108
DeepSeek-R1-Distill-Qwen-32Bdeepseek-ai32.76B4K21.63GBtext-generation54
DeepSeek-R1-Distill-Qwen-7Bdeepseek-ai7.62B4K5.03GBtext-generation163
DeepSeek-R1-NVFP4nvidia396.77B4K261.87GBtext-generation0
DeepSeek-V2deepseek-ai235.74B4K155.58GBtext-generation1
DeepSeek-V2.5deepseek-ai235.74B4K155.58GBtext-generation1
DeepSeek-V2-Chatdeepseek-ai235.74B4K155.58GBtext-generation1
DeepSeek-V2-Chat-0628deepseek-ai235.74B4K155.58GBtext-generation1
DeepSeek-V2-Litedeepseek-ai15.71B4K10.36GBtext-generation106
DeepSeek-V2-Lite-Chatdeepseek-ai15.71B4K10.36GBtext-generation106
Deepseek-V2 ProDeepSeek AI70.00B131K45.02GBchat, code, reasoning36
DeepSeek-V3-0324deepseek-ai684.53B4K451.79GBtext-generation0
DeepSeek-V3-0324-NVFP4nvidia396.77B4K261.87GBtext-generation0
DeepSeek-V3.1-NVFP4nvidia393.63B4K259.8GBtext-generation0
DeepSeek-V3.2DeepSeek AI70.00B131K77.47GBreasoning, agentic workflows15
DeepSeek-V3.2-NVFP4nvidia394.50B4K260.37GBtext-generation0
DialoGPT-smallmicrosoft0.18B4K0.12GBtext-generation169
distilgpt2distilbert0.09B4K0.06GBtext-generation169
dolphin-2.9.1-yi-1.5-34bdphn34.39B4K22.69GBtext-generation54
Dolphin-Mistral-24B-Venice-Editiondphn23.57B4K15.55GBtext-generation81
ELMJoaoffg0.90B4K0.59GBtext-generation169
falcon-11Btiiuae11.10B4K7.33GBtext-generation141
falcon-7b-instructtiiuae7.22B4K4.76GBtext-generation163
Falcon-H1-0.5B-Basetiiuae0.52B4K0.34GBtext-generation169
Falcon-H1-0.5B-Instructtiiuae0.52B4K0.34GBtext-generation169
Falcon-H1-1.5B-Basetiiuae1.55B4K1.02GBtext-generation169
Falcon-H1-1.5B-Instructtiiuae1.55B4K1.02GBtext-generation169
Falcon-H1-34B-Basetiiuae33.64B4K22.21GBtext-generation54
Falcon-H1-34B-Instructtiiuae33.64B4K22.21GBtext-generation54
Falcon-H1-3B-Basetiiuae3.15B4K2.08GBtext-generation169
Falcon-H1-3B-Instructtiiuae3.15B4K2.08GBtext-generation169
Falcon-H1-7B-Basetiiuae7.59B4K5GBtext-generation163
Falcon-H1-7B-Instructtiiuae7.59B4K5GBtext-generation163
Falcon-H1-Tiny-90M-Instructtiiuae0.09B4K0.06GBtext-generation169
falcon-mamba-7b-instructtiiuae7.27B4K4.8GBtext-generation163
falcon-mamba-tiny-devtiiuae0.01B4K0.01GBtext-generation169
Falcon3-10B-Basetiiuae10.31B4K6.8GBtext-generation141
Falcon3-1B-Instructtiiuae1.67B4K1.1GBtext-generation169
Falcon3-3B-Basetiiuae3.23B4K2.13GBtext-generation169
Falcon3-3B-Instructtiiuae3.23B4K2.13GBtext-generation169
Falcon3-7B-Basetiiuae7.46B4K4.92GBtext-generation163
Falcon3-7B-Instructtiiuae7.46B4K4.92GBtext-generation163
Flex-reddit-2x7B-1Tallenai11.63B4K7.68GBtext-generation141
gemma-1.1-2b-itgoogle2.51B4K1.65GBtext-generation169
gemma-1.1-7b-itgoogle8.54B4K5.63GBtext-generation163
gemma-2-27b-itgoogle27.23B4K17.97GBtext-generation58
gemma-2-9b-itgoogle9.24B4K6.11GBtext-generation141
GLM-4.7-Flash-MLX-6bitlmstudio-community6.56B4K4.32GBtext-generation163
GLM-4.7-Flash-MLX-8bitlmstudio-community8.43B4K5.57GBtext-generation163
gpt-neo-1.3BEleutherAI1.37B4K0.9GBtext-generation169
gpt-neo-125mEleutherAI0.15B4K0.1GBtext-generation169
gpt-neo-2.7BEleutherAI2.72B4K1.79GBtext-generation169
gpt-oss-120bopenai120.41B4K79.48GBtext-generation15
gpt-oss-120b-Eagle3-long-contextnvidia0.22B4K0.14GBtext-generation169
gpt-oss-20bopenai21.51B4K14.2GBtext-generation81
gpt2openai-community0.14B4K0.09GBtext-generation169
gpt2-largeopenai-community0.81B4K0.54GBtext-generation169
gpt2-mediumopenai-community0.38B4K0.25GBtext-generation169
gpt2-minierwanf0.04B4K0.02GBtext-generation169
h2ovl-mississippi-2bh2oai2.15B4K1.42GBtext-generation169
h2ovl-mississippi-800mh2oai0.83B4K0.55GBtext-generation169
Hermes-2-Pro-Llama-3-8BNousResearch8.03B4K5.3GBtext-generation163
Hermes-2-Pro-Mistral-7BNousResearch7.24B4K4.79GBtext-generation163
Hermes-2-Theta-Llama-3-8BNousResearch8.03B4K5.3GBtext-generation163
Hermes-3-Llama-3.1-8BNousResearch8.03B4K5.3GBtext-generation163
Hermes-4-14BNousResearch14.77B4K9.75GBtext-generation108
internlm2_5-7binternlm7.74B4K5.1GBtext-generation163
internlm2-chat-1_8binternlm1.89B4K1.24GBtext-generation169
internlm2-chat-20binternlm19.86B4K13.11GBtext-generation81
internlm2-chat-7b-sftinternlm7.74B4K5.1GBtext-generation163
Jan-v3-4B-base-instructjanhq4.41B4K2.92GBtext-generation169
japanese-gpt-neox-smallrinna0.20B4K0.13GBtext-generation169
LFM2-24B-A2BLiquidAI23.84B4K15.74GBtext-generation81
LFM2.5-1.2B-InstructLiquidAI1.17B4K0.77GBtext-generation169
LFM2.5-1.2B-Instruct-MLX-4bitlmstudio-community0.18B4K0.12GBtext-generation169
LFM2.5-1.2B-Instruct-MLX-6bitlmstudio-community0.26B4K0.17GBtext-generation169
LFM2.5-1.2B-Instruct-MLX-8bitlmstudio-community0.33B4K0.22GBtext-generation169
LFM2-8B-A1BLiquidAI8.34B4K5.5GBtext-generation163
Llama-2-7b-hfmeta-llama6.74B4K4.44GBtext-generation163
Llama-3.1-405B-Instructmeta-llama405.85B4K267.86GBtext-generation0
Llama-3.1-405B-Instruct-FP8meta-llama405.87B4K267.87GBtext-generation0
Llama 3.1 70BMeta70.00B131K44GBchat, coding, reasoning36
Llama-3.1-70B-Instructmeta-llama70.55B4K46.56GBtext-generation36
Llama 3.1 8BMeta8.00B131K5.5GBchat, coding, summarization163
Llama-3.1-8B-Instruct-FP8nvidia8.03B4K5.3GBtext-generation163
Llama-3.1-Tulu-3-8B-SFTallenai8.03B4K5.3GBtext-generation163
Llama-3.2-1Bmeta-llama1.24B4K0.81GBtext-generation169
Llama-3.2-1B-Instruct-FP8RedHatAI1.50B4K0.99GBtext-generation169
Llama-3.2-1B-Instruct-FP8-dynamicRedHatAI1.50B4K0.99GBtext-generation169
Llama-3.2-3Bmeta-llama3.21B4K2.12GBtext-generation169
llama-3.3-70b-instruct-awqcasperhansen70.55B4K46.56GBtext-generation36
Llama-3_3-Nemotron-Super-49B-v1nvidia49.87B4K32.91GBtext-generation37
Llama-3_3-Nemotron-Super-49B-v1_5-FP8nvidia49.87B4K32.91GBtext-generation37
Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4nvidia28.97B4K19.12GBtext-generation58
Llama-3_3-Nemotron-Super-49B-v1-FP8nvidia49.87B4K32.91GBtext-generation37
llama-300M-v3-originaldeqing0.32B4K0.21GBtext-generation169
Llama-Guard-3-8Bmeta-llama8.03B4K5.3GBtext-generation163
Llama-Guard-3-8B-INT8meta-llama8.03B4K5.3GBtext-generation163
LlamaGuard-7bmeta-llama6.74B4K4.44GBtext-generation163
llm-jp-3-3.7b-instructllm-jp3.78B4K2.5GBtext-generation169
LocoOperator-4BLocoreMind4.02B4K2.65GBtext-generation169
maira-2microsoft6.88B4K4.54GBtext-generation163
MediPhi-Clinicalmicrosoft3.82B4K2.52GBtext-generation169
MediPhi-Instructmicrosoft3.82B4K2.52GBtext-generation169
Meta-Llama-3.1-70B-InstructNousResearch70.55B4K46.56GBtext-generation36
Meta-Llama-3.1-8BNousResearch8.03B4K5.3GBtext-generation163
Meta-Llama-3.1-8B-Instructunsloth8.03B4K5.3GBtext-generation163
Meta-Llama-3.1-8B-Instruct-bnb-4bitunsloth8.25B4K5.45GBtext-generation163
Meta-Llama-3.1-8B-Instruct-FP8RedHatAI8.03B4K5.3GBtext-generation163
Meta-Llama-3-70B-Instructmeta-llama70.55B4K46.56GBtext-generation36
Meta-Llama-3-8Bmeta-llama8.03B4K5.3GBtext-generation163
Meta-Llama-3-8B-Instructmeta-llama8.03B4K5.3GBtext-generation163
Meta-Llama-Guard-2-8Bmeta-llama8.03B4K5.3GBtext-generation163
MiniMax-M2.5MiniMaxAI228.70B4K150.94GBtext-generation1
MiniMax-M2-AWQQuantTrio228.69B4K150.93GBtext-generation1
Mistral 7BMistral7.00B32K5GBchat, instruction-following, translation163
Mistral-7B-Instruct-v0.2mistralai7.24B4K4.79GBtext-generation163
mistral-7b-v0.3-bnb-4bitunsloth7.47B4K4.93GBtext-generation163
Mistral-NeMo-Minitron-8B-Instructnvidia8.41B4K5.56GBtext-generation163
Mistral-Small-24B-Instruct-2501-AWQstelterlab23.57B4K15.55GBtext-generation81
Mixtral-8x7B-Instruct-v0.1-GPTQTheBloke46.71B4K30.83GBtext-generation40
Nanbeige4.1-3BNanbeige3.93B4K2.6GBtext-generation169
Nanbeige4.1-3B-hereticheretic-org3.93B4K2.6GBtext-generation169
Nemotron-Flash-3Bnvidia2.75B4K1.81GBtext-generation169
Nemotron-H-4B-Base-8Knvidia4.49B4K2.96GBtext-generation169
Nemotron-H-4B-Instruct-128Knvidia4.49B4K2.96GBtext-generation169
Nous-Hermes-2-Mistral-7B-DPONousResearch7.24B4K4.79GBtext-generation163
Nous-Hermes-2-Mixtral-8x7B-DPONousResearch46.70B4K30.82GBtext-generation40
Nous-Hermes-2-SOLAR-10.7BNousResearch10.73B4K7.08GBtext-generation141
Nous-Hermes-llama-2-7bNousResearch6.74B4K4.44GBtext-generation163
NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16nvidia31.58B4K20.85GBtext-generation54
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16nvidia31.58B4K20.85GBtext-generation54
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8nvidia31.58B4K20.85GBtext-generation54
NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4nvidia18.24B4K12.03GBtext-generation81
NVIDIA-Nemotron-Nano-9B-v2nvidia8.89B4K5.86GBtext-generation163
NVIDIA-Nemotron-Nano-9B-v2-Basenvidia8.89B4K5.86GBtext-generation163
NVIDIA-Nemotron-Nano-9B-v2-FP8nvidia8.89B4K5.86GBtext-generation163
NVIDIA-Nemotron-Nano-9B-v2-Japanesenvidia8.89B4K5.86GBtext-generation163
OLMo-1Ballenai1.18B4K0.78GBtext-generation169
OLMo-1B-0724-hfallenai1.28B4K0.85GBtext-generation169
OLMo-1B-hfallenai1.18B4K0.78GBtext-generation169
OLMo-2-0325-32Ballenai32.23B4K21.27GBtext-generation54
OLMo-2-0325-32B-Instructallenai32.23B4K21.27GBtext-generation54
OLMo-2-0425-1Ballenai1.48B4K0.98GBtext-generation169
OLMo-2-0425-1B-Instructallenai1.48B4K0.98GBtext-generation169
OLMo-2-0425-1B-RLVR1allenai1.48B4K0.98GBtext-generation169
OLMo-2-1124-13B-Instructallenai13.72B4K9.05GBtext-generation108
OLMo-2-1124-7B-Instructallenai7.30B4K4.82GBtext-generation163
Olmo-3.1-32B-Thinkallenai32.23B4K21.27GBtext-generation54
Olmo-3.1-7B-RL-Zero-Mathallenai7.30B4K4.82GBtext-generation163
Olmo-3-1025-7Ballenai7.30B4K4.82GBtext-generation163
Olmo-3-1125-32Ballenai32.23B4K21.27GBtext-generation54
Olmo-3-32B-Thinkallenai32.23B4K21.27GBtext-generation54
Olmo-3-7B-Instructallenai7.30B4K4.82GBtext-generation163
Olmo-3-7B-Instruct-DPOallenai7.30B4K4.82GBtext-generation163
Olmo-3-7B-Instruct-SFTallenai7.30B4K4.82GBtext-generation163
Olmo-3-7B-Thinkallenai7.30B4K4.82GBtext-generation163
Olmo-3-7B-Think-DPOallenai7.30B4K4.82GBtext-generation163
Olmo-3-7B-Think-SFTallenai7.30B4K4.82GBtext-generation163
OLMo-7B-0724-hfallenai6.89B4K4.54GBtext-generation163
OLMo-7B-hfallenai6.89B4K4.54GBtext-generation163
Olmo-Hybrid-Instruct-DPO-7Ballenai7.43B4K4.91GBtext-generation163
OLMoE-1B-7B-0125allenai6.92B4K4.57GBtext-generation163
OLMoE-1B-7B-0125-Instructallenai6.92B4K4.57GBtext-generation163
OLMoE-1B-7B-0924-Instructallenai6.92B4K4.57GBtext-generation163
phi-1microsoft1.42B4K0.94GBtext-generation169
phi-1_5microsoft1.42B4K0.94GBtext-generation169
phi-2microsoft2.78B4K1.84GBtext-generation169
Phi-3-medium-4k-instructmicrosoft13.96B4K9.22GBtext-generation108
Phi-3-mini-4k-instruct-gptq-4bitkaitchup3.82B4K2.52GBtext-generation169
Phi-3-small-8k-instructmicrosoft7.39B4K4.88GBtext-generation163
Phi-mini-MoE-instructmicrosoft7.65B4K5.05GBtext-generation163
Phi-tiny-MoE-instructmicrosoft3.76B4K2.48GBtext-generation169
polyglot-ko-1.3bEleutherAI1.43B4K0.95GBtext-generation169
polyglot-ko-12.8bEleutherAI13.06B4K8.62GBtext-generation108
polyglot-ko-5.8bEleutherAI6.00B4K3.96GBtext-generation169
pythia-1.4bEleutherAI1.52B4K1GBtext-generation169
pythia-1.4b-dedupedEleutherAI1.41B4K0.94GBtext-generation169
pythia-12bEleutherAI12.00B4K7.92GBtext-generation141
pythia-14mEleutherAI0.01B4K0.01GBtext-generation169
pythia-14m-dedupedEleutherAI0.04B4K0.02GBtext-generation169
pythia-160m-dedupedEleutherAI0.21B4K0.14GBtext-generation169
pythia-160m-seed1EleutherAI0.21B4K0.14GBtext-generation169
pythia-1bEleutherAI1.08B4K0.72GBtext-generation169
pythia-2.8b-dedupedEleutherAI2.91B4K1.93GBtext-generation169
pythia-31mEleutherAI0.03B4K0.02GBtext-generation169
pythia-31m-dedupedEleutherAI0.06B4K0.03GBtext-generation169
pythia-410mEleutherAI0.51B4K0.33GBtext-generation169
pythia-410m-dedupedEleutherAI0.51B4K0.33GBtext-generation169
pythia-410m-v0EleutherAI0.51B4K0.33GBtext-generation169
pythia-6.9bEleutherAI6.99B4K4.61GBtext-generation163
pythia-70m-dedupedEleutherAI0.10B4K0.07GBtext-generation169
Qwen 2.5 72BAlibaba72.00B131K45.02GBchat, code, reasoning36
Qwen 2.5 72B InstructQwen72.00B131K45.02GBchat, code, reasoning36
Qwen1.5-110B-Chat-AWQQwen111.21B4K73.4GBtext-generation15
Qwen2-0.5B-InstructQwen0.49B4K0.33GBtext-generation169
Qwen2-1.5B-InstructQwen1.54B4K1.02GBtext-generation169
Qwen2.5-0.5BQwen0.49B4K0.33GBtext-generation169
Qwen2.5-0.5B-InstructQwen0.49B4K0.33GBtext-generation169
Qwen2.5-1.5BQwen1.54B4K1.02GBtext-generation169
Qwen2.5-1.5B-InstructQwen1.54B4K1.02GBtext-generation169
Qwen2.5-1.5B-Instruct-AWQQwen1.78B4K1.18GBtext-generation169
Qwen2.5-1.5B-quantized.w8a8RedHatAI1.78B4K1.18GBtext-generation169
Qwen2.5-14B-Instruct-AWQQwen14.77B4K9.75GBtext-generation108
Qwen2.5-32BQwen32.76B4K21.63GBtext-generation54
Qwen2.5-32B-Instruct-AWQQwen32.76B4K21.63GBtext-generation54
Qwen2.5-3BQwen3.09B4K2.04GBtext-generation169
Qwen2.5-3B-InstructQwen3.09B4K2.04GBtext-generation169
Qwen2.5-72B-InstructQwen72.71B4K47.98GBtext-generation36
Qwen2.5-72B-Instruct-AWQQwen72.96B4K48.15GBtext-generation16
Qwen2.5-7BQwen7.62B4K5.03GBtext-generation163
Qwen2.5-7B-InstructQwen7.62B4K5.03GBtext-generation163
Qwen2.5-Coder-0.5B-InstructQwen0.49B4K0.33GBtext-generation169
Qwen2.5-Coder-1.5B-InstructQwen1.54B4K1.02GBtext-generation169
Qwen2.5-Coder-14B-InstructQwen14.77B4K9.75GBtext-generation108
Qwen2.5-Coder-32B-InstructQwen32.76B4K21.63GBtext-generation54
Qwen2.5-Coder-32B-Instruct-AWQQwen32.76B4K21.63GBtext-generation54
Qwen2.5-Coder-7B-InstructQwen7.62B4K5.03GBtext-generation163
Qwen2.5-Coder-7B-Instruct-AWQQwen7.62B4K5.03GBtext-generation163
Qwen2.5-Coder-7B-Instruct-GPTQ-Int4Qwen7.62B4K5.03GBtext-generation163
Qwen2.5-Math-1.5BQwen1.54B4K1.02GBtext-generation169
Qwen2.5-VL-7B-Instruct-NVFP4nvidia5.44B4K3.59GBtext-generation169
Qwen2 72BQwen72.00B65K45.02GBchat, code, reasoning36
Qwen2-7B-InstructQwen7.62B4K5.03GBtext-generation163
Qwen3-0.6BQwen0.75B4K0.5GBtext-generation169
Qwen3-0.6B-FP8Qwen0.75B4K0.5GBtext-generation169
Qwen3-1.7B-BaseQwen1.72B4K1.13GBtext-generation169
Qwen3-14B-InstructOpenPipe14.77B4K9.75GBtext-generation108
Qwen3-14B-NVFP4nvidia8.99B4K5.93GBtext-generation163
Qwen3-235B-A22BQwen235.09B4K155.17GBtext-generation1
Qwen3-235B-A22B-Instruct-2507-FP8Qwen235.11B4K155.17GBtext-generation1
Qwen3-235B-A22B-NVFP4nvidia132.81B4K87.65GBtext-generation8
Qwen3-30B-A3B-Instruct-2507-FP8Qwen30.53B4K20.15GBtext-generation54
Qwen3-30B-A3B-NVFP4nvidia17.45B4K11.52GBtext-generation100
Qwen3-32B-AWQQwen32.76B4K21.63GBtext-generation54
Qwen3-32B-NVFP4nvidia19.11B4K12.62GBtext-generation81
Qwen3-4B-AWQQwen4.02B4K2.65GBtext-generation169
Qwen3-4B-Instruct-2507-FP8Qwen4.41B4K2.92GBtext-generation169
Qwen3-4B-SafeRLQwen4.02B4K2.65GBtext-generation169
Qwen3.5-27B-Claude-4.6-Opus-Reasoning-DistilledJackrong27.78B4K18.34GBtext-generation58
Qwen3.5-27B-Text-NVFP4-MTPosoleve16.67B4K11GBtext-generation106
Qwen3.5-4B-Safety-ThinkingMerlinSafety4.21B4K2.77GBtext-generation169
Qwen3.5-9B-abliteratedlukey038.95B4K5.91GBtext-generation163
Qwen3-8B-AWQQwen8.19B4K5.4GBtext-generation163
Qwen3-8B-BaseQwen8.19B4K5.4GBtext-generation163
Qwen3-8B-FP8nvidia8.19B4K5.4GBtext-generation163
Qwen3-8B-NVFP4nvidia5.15B4K3.4GBtext-generation169
Qwen3-Coder-30B-A3B-Instruct-FP8Qwen30.53B4K20.15GBtext-generation54
Qwen3-Coder-NextQwen79.67B4K52.58GBtext-generation16
Qwen3-Coder-Next-8bitNexVeridian22.41B4K14.79GBtext-generation81
Qwen3-Coder-Next-AWQ-4bitbullpoint14.44B4K9.54GBtext-generation108
Qwen3-Coder-Next-BaseQwen79.67B4K52.58GBtext-generation16
Qwen3-Coder-Next-FP8Qwen79.68B4K52.59GBtext-generation16
Qwen3-Next-80B-A3B-InstructQwen81.32B4K53.67GBtext-generation16
Qwen3-Next-80B-A3B-Instruct-FP8Qwen81.33B4K53.68GBtext-generation16
Qwen3-VL-30B-A3B-Instruct-AWQQuantTrio31.07B4K20.5GBtext-generation54
Qwen3Guard-Gen-0.6BQwen0.75B4K0.5GBtext-generation169
Qwen3Guard-Gen-4BQwen4.41B4K2.92GBtext-generation169
Qwen3Guard-Gen-8BQwen8.19B4K5.4GBtext-generation163
QwQ-32B-AWQQwen32.76B4K21.63GBtext-generation54
recurrentgemma-2bgoogle2.68B4K1.77GBtext-generation169
saiga_llama3_8bIlyaGusev8.03B4K5.3GBtext-generation163
SmolLM-135M-InstructHuggingFaceTB0.13B4K0.09GBtext-generation169
SmolLM2-135MHuggingFaceTB0.13B4K0.09GBtext-generation169
SmolLM2-135M-InstructHuggingFaceTB0.13B4K0.09GBtext-generation169
SOLAR-10.7B-v1.0upstage10.73B4K7.08GBtext-generation141
StableBeluga-13Bstabilityai13.02B4K8.59GBtext-generation108
stablelm-2-1_6bstabilityai1.64B4K1.09GBtext-generation169
stablelm-2-zephyr-1_6bstabilityai1.64B4K1.09GBtext-generation169
stablelm-3b-4e1tstabilityai2.80B4K1.85GBtext-generation169
stablelm-base-alpha-7b-v2stabilityai6.89B4K4.54GBtext-generation163
stablelm-zephyr-3bstabilityai2.80B4K1.85GBtext-generation169
starchat-alphaHuggingFaceH415.52B4K10.24GBtext-generation106
Starling-LM-7B-betaNexusflow7.24B4K4.79GBtext-generation163
steerling-8bguidelabs8.39B4K5.54GBtext-generation163
Step-3.5-Flashstepfun-ai199.38B4K131.59GBtext-generation3
stories15M_MOEggml-org0.04B4K0.02GBtext-generation169
Strand-Rust-Coder-14B-v1Fortytwo-Network14.77B4K9.75GBtext-generation108
tiny-aya-globalCohereLabs3.35B4K2.21GBtext-generation169
tiny-random-Gemma2ForCausalLMhmellor0.01B4K0.01GBtext-generation169
TinyLlama-1.1B-Chat-v0.3-GPTQTheBloke1.10B4K0.73GBtext-generation169
TinyLlama-1.1B-Chat-v1.0TinyLlama1.10B4K0.73GBtext-generation169
tulu-2-dpo-70ballenai68.98B4K45.53GBtext-generation36
txgemma-2b-predictgoogle2.61B4K1.73GBtext-generation169
vaultgemma-1bgoogle1.04B4K0.68GBtext-generation169
wildguardallenai7.25B4K4.79GBtext-generation163
Yi-1.5-34B01-ai34.39B4K22.69GBtext-generation54
Yi-1.5-34B-32K01-ai34.39B4K22.69GBtext-generation54
Yi-1.5-34B-Chat01-ai34.39B4K22.69GBtext-generation54
Yi-1.5-34B-Chat-16K01-ai34.39B4K22.69GBtext-generation54
Yi-1.5-6B01-ai6.06B4K4GBtext-generation169
Yi-1.5-6B-Chat01-ai6.06B4K4GBtext-generation169
Yi-1.5-9B01-ai8.83B4K5.83GBtext-generation163
Yi-1.5-9B-32K01-ai8.83B4K5.83GBtext-generation163
Yi-1.5-9B-Chat01-ai8.83B4K5.83GBtext-generation163
Yi-1.5-9B-Chat-16K01-ai8.83B4K5.83GBtext-generation163
Yi-6B01-ai6.06B4K4GBtext-generation169
Yi-6B-200K01-ai6.06B4K4GBtext-generation169
Yi-6B-Chat01-ai6.06B4K4GBtext-generation169
Yi-9B01-ai8.83B4K5.83GBtext-generation163
Yi-9B-200K01-ai8.83B4K5.83GBtext-generation163
Yi-Coder-9B01-ai8.83B4K5.83GBtext-generation163
Yi-Coder-9B-Chat01-ai8.83B4K5.83GBtext-generation163
zephyr-7b-betaHuggingFaceH47.24B4K4.79GBtext-generation163