๐Ÿค– AI Tools
ยท 3 min read

How to Run Jais 2 Locally โ€” Arabic AI Model Setup Guide


Some links in this article are affiliate links. We earn a commission at no extra cost to you when you purchase through them. Full disclosure.

Jais 2 is the worldโ€™s best Arabic LLM. The 8B version runs on consumer hardware, giving you native Arabic AI for free. Hereโ€™s the setup.

Available models

ModelSizeRAM neededBest for
Jais 2 8B Chat~5 GB8 GBArabic chat, trained on 126B Arabic tokens
Jais 2 70B Chat~45 GB48 GB+Best Arabic quality, GGUF available
Jais 13B (v1)~8 GB12 GBLegacy, still solid

Setup with Ollama

# Install Ollama
brew install ollama

# Pull Jais (check Ollama library for latest)
ollama pull jais

# Or use GGUF from HuggingFace
# Download from huggingface.co/inceptionai/Jais-2-8B-Chat-GGUF

Setup with HuggingFace

For the full 70B model or more control:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "inceptionai/Jais-2-8B-Chat"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")

# Chat in Arabic
prompt = "ุงุดุฑุญ ู„ูŠ ูƒูŠู ูŠุนู…ู„ ุงู„ุฐูƒุงุก ุงู„ุงุตุทู†ุงุนูŠ ุจุทุฑูŠู‚ุฉ ุจุณูŠุทุฉ"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=500)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Setup with llama.cpp (GGUF)

Official GGUF quantizations are available from inceptionai/Jais-2-8B-Chat-GGUF on HuggingFace:

# Download GGUF directly
huggingface-cli download inceptionai/Jais-2-8B-Chat-GGUF --local-dir ./jais-gguf

# Run with llama.cpp
./llama-cli -m ./jais-gguf/jais-2-8b-chat-q5_k_m.gguf -p "ุงุดุฑุญ ุงู„ุฐูƒุงุก ุงู„ุงุตุทู†ุงุนูŠ" -n 500

Available quantizations:

QuantizationSizeRAM neededArabic quality
Q8_0~8.5 GB10 GBBest
Q5_K_M~5.5 GB8 GBVery good (recommended)
Q4_K_M~4.5 GB6 GBGood
Q3_K_M~3.5 GB5 GBNoticeable loss on dialects

Recommendation: Q5_K_M is the sweet spot. Arabic text is more sensitive to quantization than English because of the complex morphology โ€” going below Q4 causes noticeable quality drops on dialectal Arabic.

Hardware requirements

HardwareJais 2 8BJais 2 70B
MacBook Air M2 8GBโœ… ~15 tok/sโŒ
MacBook Pro M3 16GBโœ… ~25 tok/sโŒ
Mac Mini M4 Pro 48GBโœ… ~30 tok/sโœ… ~8 tok/s
RTX 4090 24GBโœ… ~35 tok/sโŒ (VRAM)
A100 80GBโœ… ~50 tok/sโœ… ~20 tok/s

The 8B model is the practical choice for most developers. The 70B model needs serious hardware or a cloud GPU (RunPod, Vultr).

Use cases for developers

Arabic customer support bot

system_prompt = """ุฃู†ุช ู…ุณุงุนุฏ ุฎุฏู…ุฉ ุนู…ู„ุงุก ูˆุฏูˆุฏ. ุฃุฌุจ ุนู„ู‰ ุฃุณุฆู„ุฉ ุงู„ุนู…ู„ุงุก ุจุงู„ู„ุบุฉ ุงู„ุนุฑุจูŠุฉ ุจุทุฑูŠู‚ุฉ ู…ู‡ู†ูŠุฉ ูˆู…ููŠุฏุฉ."""

response = chat(system_prompt, user_message)

Arabic code documentation

prompt = "ุงูƒุชุจ ุชูˆุซูŠู‚ุงู‹ ุจุงู„ุนุฑุจูŠุฉ ู„ู‡ุฐู‡ ุงู„ุฏุงู„ุฉ:\n\ndef calculate_tax(income, rate):\n    return income * rate"

Arabic-English translation with context

Jais understands code-switching (mixing Arabic and English), which is common in tech contexts across the Middle East.

Jais vs general-purpose models for Arabic

ModelArabic qualityArabic dialectsEnglishRun locally
Jais 2 8Bโœ… Bestโœ… MSA + dialectsGoodโœ… 8GB
Jais 2 70Bโœ… Bestโœ… MSA + dialectsGood48GB+
GPT-5GoodBasic MSA onlyโœ… BestโŒ API only
Qwen 3.5DecentBasicโœ… Excellentโœ… 20GB
Llama 4DecentBasicโœ… Excellentโœ… varies

General-purpose models handle Modern Standard Arabic (MSA) reasonably well, but they struggle with dialects (Egyptian, Gulf, Levantine, Moroccan). Jais was specifically trained on dialectal Arabic data.

Jais + Falcon: the UAE AI stack

For developers building Arabic applications, the optimal stack is:

  • Jais 2 for Arabic language tasks (chat, content, translation)
  • Falcon H1R for reasoning and coding tasks
  • Falcon 2 for multilingual tasks

All open source, all from the UAE ecosystem. See our Falcon vs Jais comparison for detailed analysis.

Related: What is Jais? ยท Falcon vs Jais ยท What is Falcon? ยท Ollama Complete Guide ยท Best Open Source Coding Models ยท Self-Hosted AI for Enterprise