Thursday, 20 Nov 2025
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • VIDEO
  • House
  • White
  • ScienceAlert
  • Trumps
  • Watch
  • man
  • Health
  • Season
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > Open source LLMs hit Europe’s digital sovereignty roadmap
Tech and Science

Open source LLMs hit Europe’s digital sovereignty roadmap

Last updated: February 16, 2025 7:20 am
Share
Open source LLMs hit Europe’s digital sovereignty roadmap
SHARE

Large language models (LLMs) have recently taken center stage on Europe’s digital sovereignty agenda with the launch of a new program called OpenEuroLLM. This initiative aims to develop a series of open source LLMs covering all European Union languages, including the official 24 EU languages and languages for countries seeking entry into the EU market like Albania. The project, co-led by Jan HajiÄŤ and Peter Sarlin, involves collaboration between 20 organizations and is part of Europe’s broader push for digital sovereignty.

The OpenEuroLLM project has a budget of €37.4 million, with funding coming from the EU’s Digital Europe Programme. Partners include EuroHPC supercomputer centers in several European countries. Despite the ambitious goal of developing multilingual LLMs, some have raised concerns about the project’s feasibility due to the involvement of multiple organizations with different priorities.

Jan HajiÄŤ, who is also coordinating the High Performance Language Technologies (HPLT) project, sees OpenEuroLLM as a continuation of HPLT with a focus on generative LLMs. The project aims to release the first versions by mid-2026 and final iterations by 2028. While starting from scratch in terms of data and tools, the project benefits from the expertise of its partners.

Participating organizations include academic and research institutions from Czechia, the Netherlands, Germany, Sweden, Finland, and Norway, as well as corporate entities like Silo AI, Aleph Alpha, Ellamind, Prompsit Language Engineering, and LightOn. Notably absent from the list is Mistral, a French AI company known for its open source approach. While efforts were made to involve Mistral in the project, discussions did not progress.

See also  Learner Tien and Alex Michelsen’s Australian Open is a milestone for American’s men’s tennis

The project’s ultimate goal is to create foundation models for transparent AI in Europe that preserve linguistic and cultural diversity. This includes developing a core multilingual LLM for general-purpose tasks and smaller, more efficient versions for edge applications. Detailed plans for the project’s deliverables are still in development, with a focus on balancing size and quality. The OpenEuroLLM project is striving to create a large language model that is proficient in all languages, with a particular focus on ensuring equality across the board. However, achieving this goal may be challenging, especially for languages with limited digital resources. To address this, the project is working on establishing true benchmarks that are representative of each language and its cultural nuances.

One of the key components of the project is the data it utilizes. The HPLT project has released version 2.0 of its dataset, which includes 4.5 petabytes of web crawls and over 20 billion documents. Additionally, data from Common Crawl, an open repository of web-crawled data, will be incorporated into the mix to further enhance the model’s training.

In the realm of open source AI, there has been a debate about what constitutes true openness. While the Open Source Initiative has defined open source AI, there are differing opinions on whether training data should be included in the definition. The OpenEuroLLM project aims to be as open as possible, but certain limitations may require them to keep some training data confidential, although it will be accessible for auditing purposes as per EU regulations.

Despite its commitment to openness, the OpenEuroLLM project has faced criticism for similarities to the EuroLLM project, which launched earlier in Europe with EU funding. The two projects share common goals of creating open source language models for European languages, but due to funding restrictions, collaborations between the two may be limited.

See also  How Do You Weigh a Black Hole?

In terms of funding, the OpenEuroLLM project is confident that it will have sufficient resources to support its goals. Partnering with EuroHPC centers, which have invested billions in AI and compute infrastructure, will provide the necessary funding for the project. The focus of the project is on building foundational models rather than consumer or enterprise-grade products, which helps streamline the budget allocation.

Overall, the OpenEuroLLM project is dedicated to creating a high-quality, open source language model that can serve as a foundational AI infrastructure for companies in Europe. With a strong focus on data quality, cultural representation, and collaboration within the EU, the project aims to make significant strides in the field of AI language models. The upcoming Europa models from OpenEuroLLM are set to revolutionize language processing in Europe. These new models will support all European languages, building upon the foundation laid by the current models that already cover a handful of European languages. This advancement aligns with the vision of not starting from scratch, as emphasized by HajiÄŤ, who recognizes the existing expertise and technology in place.

Critics have pointed out the complexity of OpenEuroLLM, but HajiÄŤ views this as a positive aspect. He believes that collaborative projects, leveraging both academic expertise and industry focus, can bring about innovative solutions. The goal is not to compete with Big Tech or billion-dollar AI startups, but to achieve digital sovereignty through (mostly) open foundation LLMs developed by and for Europe.

HajiÄŤ emphasizes the importance of having a European-based model, even if it may not be the top performer globally. The focus is on creating a model that encompasses all necessary components within Europe, ensuring a positive outcome regardless of rankings. This commitment to digital sovereignty sets OpenEuroLLM apart and paves the way for a new era of language processing technology in Europe.

See also  Karen Read told to 'calm down, stop talking' after saying 'I hit him' following Boston cop boyfriend's death
TAGGED:DigitalEuropeshitLLMsOpenroadmapsourcesovereignty
Share This Article
Twitter Email Copy Link Print
Previous Article ‘Brand New Life’ and ‘Man From Nowhere’ Star Was 24 ‘Brand New Life’ and ‘Man From Nowhere’ Star Was 24
Next Article Trump is shocking official Washington. Will he leave his mark on the District too? Trump is shocking official Washington. Will he leave his mark on the District too?
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts

Scooter Braun Struggled With Suicidal Ideation After Drama, Divorce

Scooter Braun, the renowned music manager, recently shared his struggles with suicidal ideation during a…

June 9, 2025

Deal of the Day: Save 20% With The Container Store

Teachers are always looking for ways to stay organized in the classroom, and the Container…

August 5, 2025

Another Hoax Goes Up in Smoke: MSNBC Forced to Issue Retraction Live On-Air After Guest Floats Bogus Claim About Kash Patel (VIDEO) |

Another Day, Another Fabrication The latest narrative spun around former FBI Director Kash Patel has…

May 5, 2025

Chilling doorbell footage captures trio in Halloween masks threaten to kill widow, family

A terrifying incident unfolded in Virginia when a trio of individuals wearing Halloween masks targeted…

October 17, 2025

Free Holiday Gift Tags To Create Inexpensive Student Gifts

Write an new detailed article from If the holiday rush has caught you by surprise,…

October 29, 2025

You Might Also Like

Quantum Teleportation Achieved Between Photons, Scientists Confirm : ScienceAlert
Tech and Science

Quantum Teleportation Achieved Between Photons, Scientists Confirm : ScienceAlert

November 20, 2025
CDC Vaccine Website Promotes Antiscience Claims of Autism Ties
Tech and Science

CDC Vaccine Website Promotes Antiscience Claims of Autism Ties

November 20, 2025
Common type of inflammatory bowel disease linked to toxic bacteria
Tech and Science

Common type of inflammatory bowel disease linked to toxic bacteria

November 20, 2025
Grok says Elon Musk is better than basically everyone, except Shohei Ohtani
Tech and Science

Grok says Elon Musk is better than basically everyone, except Shohei Ohtani

November 20, 2025
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?