• About Us
  • Advertising
  • Digital Magazine
  • Supplements
  • Media Pack
  • Privacy Policy
  • Contact us
CXO Insight Middle East
  • News
  • Opinion
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
    • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Awards
      • 2025
      • 2024
      • 2023
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CXO50 Oman
    • CXO50
    • ICT Awards
      • Dubai 2025
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • All events
  • Digital Magazine
  • GITEX GLOBAL
No Result
View All Result
CXO Insight Middle East
  • News
  • Opinion
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
    • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Awards
      • 2025
      • 2024
      • 2023
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CXO50 Oman
    • CXO50
    • ICT Awards
      • Dubai 2025
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • All events
  • Digital Magazine
  • GITEX GLOBAL
No Result
View All Result
CXO Insight Middle East
No Result
View All Result

Abu Dhabi’s G42 sets new benchmark for Arabic NLP innovation

by CXO Staff
August 5, 2024
in Future, News, Tech

Developed by Inception, a G42 company, this extensive suite of AI models sets a new benchmark in Arabic text processing and reasoning

LLM

Inception, a subsidiary of G42, has released JAIS 70B, a new large language model (LLM), which is aimed at enhancing capabilities in customer service, content creation, and data analysis.

With its 70 billion parameters, it brings unparalleled Arabic-English bilingual capabilities to the open-source community. The model’s development involved extensive training on a dataset of 370 billion tokens, 330 billion of which were Arabic—the largest Arabic dataset ever used for training an open-source foundational model.

Dr. Andrew Jackson, CEO of Inception, said, “AI is now a proven value-adding force, and large language models have been at the forefront of the AI adoption spike. JAIS was created to preserve Arabic heritage, culture, and language, and to democratise access to AI. Releasing JAIS 70B and this new family of models reinforces our commitment to delivering the highest quality AI foundation model for Arabic speaking nations.”

A comprehensive suite of models

In addition to JAIS 70B, Inception has unveiled a comprehensive suite of JAIS models designed to cater to a wide range of applications. This suite includes 20 models across eight different sizes, ranging from 590 million to 70 billion parameters. These models have been specifically fine-tuned for chat applications and trained on up to 1.6 trillion tokens, incorporating Arabic, English, and code data.

Neha Sengupta, Principal Applied Scientist at Inception, said, “For models up to 30 billion parameters, we successfully trained JAIS from scratch, consistently outperforming adapted models in the community. However, for models with 70 billion parameters and above, the computational complexity and environmental impact of training from scratch were significant. We made a choice to build JAIS 70B on the Llama2 model, allowing us to leverage the extensive knowledge base of an existing English model and develop a more efficient and sustainable solution.”

According to the company, JAIS 70B not only matches but in specific cases exceeds the high-quality English-language processing capabilities of Llama2, particularly excelling in Arabic outputs. The development team enhanced the tokeniser based on the Llama2 tokeniser to double the model’s base vocabulary, making Arabic text processing more efficient. According to Sengupta, this adjustment “splits Arabic words less aggressively and makes training and inferencing cheaper” than the standard Llama2 model.

Inception’s JAIS 70B follows the successful launches of JAIS-13B and JAIS-30B models, which have already set high benchmarks in the field. With JAIS 70B, Inception continues to push the boundaries of what’s possible in AI, particularly for under-served languages.

Researchers, developers, and businesses interested in leveraging the capabilities of JAIS 70B can download the models and access the technical paper and benchmarking results by visiting the dedicated page on Hugging Face: JAIS on Hugging Face.

Tags: Abu Dhabiarabic NLPfeaturedG42LLMNLPUAE
ShareTweet

Related Posts

Covoro YouCloud unveils Agentic AI UAE E-Invoicing solution at Tax Technology Summit
Banking and Finance

Covoro YouCloud unveils Agentic AI UAE E-Invoicing solution at Tax Technology Summit

December 5, 2025

Covoro YouCloud, a strategic joint venture formed to accelerate digital tax transformation across the GCC, announced its participation as a...

Human error fuels breaches as only half of professionals receive cybersecurity training, Kaspersky finds
Future

Human error fuels breaches as only half of professionals receive cybersecurity training, Kaspersky finds

December 5, 2025

A recent Kaspersky survey in the Middle East, Turkiye and Africa (META) region entitled “Cybersecurity in the workplace: Employee knowledge...

Discussion about this post

Latest Issue

Is your IT estate holding your organisation back from fully embracing AI?

Is your IT estate holding your organisation back from fully embracing AI?

December 6, 2025
Covoro YouCloud unveils Agentic AI UAE E-Invoicing solution at Tax Technology Summit

Covoro YouCloud unveils Agentic AI UAE E-Invoicing solution at Tax Technology Summit

December 5, 2025
Human error fuels breaches as only half of professionals receive cybersecurity training, Kaspersky finds

Human error fuels breaches as only half of professionals receive cybersecurity training, Kaspersky finds

December 5, 2025

The most trusted source of strategic intelligence for IT decision makers in the Middle East.

About

  • About Us
  • Advertising
  • Digital Magazine
  • Supplements
  • Media Pack
  • Contact Us

Policies

  • Privacy Policy
© 2025 – CXO Insight Middle East. All Rights Reserved.
Facebook-f X-twitter Linkedin
Separated they live in Bookmarksgrove right at the coast of the Semantics, a large language ocean. A small river named Duden.

About

  • About Us
  • Site Map
  • Contact Us
  • Career

Policies

  • Help Center
  • Privacy Policy
  • Cookie Setting
  • Term Of Use

Join Our Newsletter

© 2024 – CXO Insight Middle East. All Rights Reserved.

Facebook-f Twitter Youtube Instagram

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Join our mailing list
Sign up here to get the latest news, updates and special offers delivered directly to your inbox.
No Result
View All Result
  • News
  • Opinions
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
  • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Awards
      • 2025
      • 2024
      • 2023
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CX50 Oman
    • CXO50
    • ICT Awards
      • Dubai
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • All events
  • Videos
  • GITEX GLOBAL
  • Digital Magazine

© 2025 - CXO Insight Middle East. All Rights Reserved.