Ecco Italia, a large language AI model like Gpt, all Italian
It was presented today, and published open source for free download, by the Italian company iGenius in collaboration with Cineca
4' min read
4' min read
Ecco Italia, a large language AI model like Gpt, all-Italian. It was presented today, and published open source for free download, by the Italian company iGenius in collaboration with Cineca (Italy's largest computing pole, an inter-university consortium).
Although the version is still 0.1, Italia stands today as the largest and most accomplished large language model made in Italy, formed with our language and designed for the development of Italian companies and public administrations.
In short, the Italian soul is present on several levels, as explained in today's presentation by the company. It is in the database used, more than 90 per cent Italian data, with the advantage of a better understanding of our language, its nuances, and our historical and cultural context. It also comes with an efficiency gain of 60 per cent, because the current models, based on English, when they have to handle other languages, do a continuous translation job that is invisible to the user.
Italianness is also in the spirit of the product: the objective, declared today, is to help Italy be an actress in this revolution and not a mere consumer of foreign products. This is why Italy is open source, to be an enabling element for the development of the country, our companies and PA; without any more dependence on foreign products.
The Distinctive Elements of Italy
From a technical point of view, Italia has 9 billion parameters, a context window of 4,096 tokens and a vocabulary of 50,000 tokens. It used trillions of tokens for training, using a heterogeneous mix of sources: public sources, synthetic data and industry content provided by iGenius' commercial partners.


