DICI-A Dictionary of Italian Collocations for Learners

SSD: L-LIN/01

Settori ERC:

SH4_9 Theoretical linguistics; computational linguistics

SH4_8 Language learning and processing - first and second languages

Durata:

October 2023

February 2026

The main objective of the project is to create and make public the DiCI-A (Dictionary of Italian Collocations for Learners), a dictionary of collocations designed specifically for L2 Italian learners. The DiCI-A aims to be a freely accessible digital dictionary, searchable through an interface accessible from both desktop and mobile devices. The project intends to fill a gap in the field of Italian lexicography, which currently does not offer collocation tools specifically for learners, based on balanced written and spoken corpora, and designed using rigorous statistical and lexicographic methods, with collocations graded according to proficiency level.

Some features of the dictionary are:

it is corpus-based (the collocations are extracted from a reference corpus of contemporary Italian);
it is based on statistical methodologies (collocations are filtered and ordered based on their frequency, their dispersion across the different textual genres in the corpus, and specific association measures);
each collocation included in the DiCI-A is assigned to a level of the Common European Framework of Reference (CEFR), and is accompanied by a definition and authentic examples;
the structure of the dictionary conforms to the international standards of digital lexicography;
the usability of the dictionary is ensured through a mobile application, which will allow learners to consult it in a guided and intuitive way, individually or also in the context of classroom learning. Consultation will also be available via a web platform.

Attività / Fasi del progetto:

The project includes the following phases:

WP1 – Project start and administrative management. Review of previous literature on collocation dictionaries.

Responsible: UnistraPG.

WP2 – Selection of criteria for extracting candidate collocations to be included in the dictionary. In particular:

the syntactic structure of the combinations to be extracted from the reference corpus;

the reference corpus to be used;

the statistical measures and threshold values to be used for extracting candidate collocations.

Responsible: UnistraPG.

WP3 – Dictionary design: entry structure, graphical and computational interface.

Responsible: UniPG.

WP4 – Extraction and filtering of collocations through statistical measures: final list of collocations.

Responsible: UnistraPG.

WP5 – Assignment of collocations to CEFR levels, and evaluation.

Responsible: UnistraPG.

WP6 – Addition of definitions and examples, and testing of their functionality.

Responsible: UnistraPG.

WP7 – Creation of the mobile and desktop application, and quality and interface testing.

Responsible: UniPG.

WP8 – Dissemination of results.

Responsible: UnistraPG.

Coordinatore:

Prof. Stefania SPINA

Team:

Prof. Stefania SPINA

Dr. Irene Fioravanti

Dr. Luciana Forti

Dr. Fabio Zanda (until October 2025)

Navigazione principale

University

Services and opportunities

Educational offer

International mobility

Search

DICI-A Dictionary of Italian Collocations for Learners

Settori ERC:

Durata:

Attività / Fasi del progetto:

Coordinatore:

Team:

Would you like to share this content?