• About Us
  • Advertising
  • Digital Magazine
  • Supplements
  • Media Pack
  • Privacy Policy
  • Contact us
CXO Insight Middle East
  • News
  • Opinion
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
    • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CXO50 Oman
    • CXO50
    • ICT Awards
      • Dubai 2025
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • Channel Awards 2024
    • All events
  • GITEX
  • Digital Magazine
No Result
View All Result
CXO Insight Middle East
  • News
  • Opinion
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
    • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CXO50 Oman
    • CXO50
    • ICT Awards
      • Dubai 2025
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • Channel Awards 2024
    • All events
  • GITEX
  • Digital Magazine
No Result
View All Result
CXO Insight Middle East
No Result
View All Result

Abu Dhabi’s G42 sets new benchmark for Arabic NLP innovation

by CXO Staff
August 5, 2024
in Future, News, Tech

Developed by Inception, a G42 company, this extensive suite of AI models sets a new benchmark in Arabic text processing and reasoning

LLM

Inception, a subsidiary of G42, has released JAIS 70B, a new large language model (LLM), which is aimed at enhancing capabilities in customer service, content creation, and data analysis.

With its 70 billion parameters, it brings unparalleled Arabic-English bilingual capabilities to the open-source community. The model’s development involved extensive training on a dataset of 370 billion tokens, 330 billion of which were Arabic—the largest Arabic dataset ever used for training an open-source foundational model.

Dr. Andrew Jackson, CEO of Inception, said, “AI is now a proven value-adding force, and large language models have been at the forefront of the AI adoption spike. JAIS was created to preserve Arabic heritage, culture, and language, and to democratise access to AI. Releasing JAIS 70B and this new family of models reinforces our commitment to delivering the highest quality AI foundation model for Arabic speaking nations.”

A comprehensive suite of models

In addition to JAIS 70B, Inception has unveiled a comprehensive suite of JAIS models designed to cater to a wide range of applications. This suite includes 20 models across eight different sizes, ranging from 590 million to 70 billion parameters. These models have been specifically fine-tuned for chat applications and trained on up to 1.6 trillion tokens, incorporating Arabic, English, and code data.

Neha Sengupta, Principal Applied Scientist at Inception, said, “For models up to 30 billion parameters, we successfully trained JAIS from scratch, consistently outperforming adapted models in the community. However, for models with 70 billion parameters and above, the computational complexity and environmental impact of training from scratch were significant. We made a choice to build JAIS 70B on the Llama2 model, allowing us to leverage the extensive knowledge base of an existing English model and develop a more efficient and sustainable solution.”

According to the company, JAIS 70B not only matches but in specific cases exceeds the high-quality English-language processing capabilities of Llama2, particularly excelling in Arabic outputs. The development team enhanced the tokeniser based on the Llama2 tokeniser to double the model’s base vocabulary, making Arabic text processing more efficient. According to Sengupta, this adjustment “splits Arabic words less aggressively and makes training and inferencing cheaper” than the standard Llama2 model.

Inception’s JAIS 70B follows the successful launches of JAIS-13B and JAIS-30B models, which have already set high benchmarks in the field. With JAIS 70B, Inception continues to push the boundaries of what’s possible in AI, particularly for under-served languages.

Researchers, developers, and businesses interested in leveraging the capabilities of JAIS 70B can download the models and access the technical paper and benchmarking results by visiting the dedicated page on Hugging Face: JAIS on Hugging Face.

Tags: Abu Dhabiarabic NLPfeaturedG42LLMNLPUAE
ShareTweet

Related Posts

Cloudera joins AI-RAN Alliance
Future

Cloudera joins AI-RAN Alliance

Cloudera announced it has joined the AI-RAN Alliance, a global consortium committed to integrating AI into telecommunications infrastructure. Cloudera joins...

June 12, 2025
Sophos updates its Sophos Firewall software
Future

Sophos updates its Sophos Firewall software

Sophos announces an update to its Sophos Firewall, now including Sophos NDR Essential, which is free for all customers with...

June 12, 2025

Discussion about this post

Latest Issue

Cloudera joins AI-RAN Alliance

Cloudera joins AI-RAN Alliance

June 12, 2025
Sophos updates its Sophos Firewall software

Sophos updates its Sophos Firewall software

June 12, 2025
EC-Council continues driving inclusion in cyber through CyberSHE

EC-Council continues driving inclusion in cyber through CyberSHE

June 12, 2025

The most trusted source of strategic intelligence for IT decision makers in the Middle East.

About

  • About Us
  • Advertising
  • Digital Magazine
  • Supplements
  • Media Pack
  • Contact Us

Policies

  • Privacy Policy

© 2024 – CXO Insight Middle East. All Rights Reserved.

Facebook-f X-twitter Linkedin
Separated they live in Bookmarksgrove right at the coast of the Semantics, a large language ocean. A small river named Duden.

About

  • About Us
  • Site Map
  • Contact Us
  • Career

Policies

  • Help Center
  • Privacy Policy
  • Cookie Setting
  • Term Of Use

Join Our Newsletter

© 2024 – CXO Insight Middle East. All Rights Reserved.

Facebook-f Twitter Youtube Instagram

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Join our mailing list
Sign up here to get the latest news, updates and special offers delivered directly to your inbox.
No Result
View All Result
  • News
  • Opinions
  • Business
    • Industries
      • Transport
      • Retail
      • Government
      • Real Estate
      • Education
      • Energy
      • Banking and Finance
  • Channel
  • Future
    • Tech
    • Gadgets
    • Science
    • Space
    • Sustainability
  • Events
    • Channel Insights Summit 2025
    • Insight Innovation Summit
    • CX50 Oman
    • CXO50
    • ICT Awards
      • Dubai
      • Saudi Arabia
    • Cyber Strategists Summit
    • Cloud Connect 2025
    • Channel Awards 2023
    • All events
  • Videos
  • GITEX GLOBAL
  • Digital Magazine

© 2024 - CXO Insight Middle East. All Rights Reserved.