DICI-A Dictionary of Italian Collocations for Learners

SSD: L-LIN/01

ERC Sectors:

SH4_9 Theoretical linguistics; computational linguistics

SH4_8 Language learning and processing - first and second languages

Durata:

October 2023

February 2026

The main goal of the project is to create and make public the DiCI-A (Dictionary of Italian Collocations for Learners), a dictionary of collocations specifically designed for learners of Italian as a second language. The DICI-A aims to be a freely accessible digital dictionary that can be searched through an interface accessible from both desktop and mobile devices. The project seeks to fill a gap in the field of Italian lexicography, which currently does not offer tools for collocations specifically designed for learners, based on balanced written and spoken corpora, and designed according to rigorous statistical and lexicographic methods, in which collocations are graded according to proficiency level.

Some features of the dictionary are:

it is corpus-based (the collocations are extracted from a reference corpus of contemporary Italian);
it is based on statistical methodologies (collocations are filtered and ranked based on their frequency, their dispersion across different textual genres in the corpus, and specific association measures);
each collocation included in the DICI-A is assigned to a level of the Common European Framework of Reference (CEFR), and is accompanied by a definition and authentic examples;
the structure of the dictionary complies with international standards set for digital lexicography;
the usability of the dictionary is ensured through a mobile application, allowing learners to consult it in a guided and intuitive way, either individually or in a classroom learning context. Consultation will also be available through a web platform.

Activities / Project Phases:

The project includes the following phases:

WP1 – Project initiation and administrative management. Analysis of previous literature on collocations dictionaries.

Responsible: UnistraPG.

WP2 – Selection of criteria for extracting candidate collocations to be included in the dictionary. In particular:

the syntactic structure of the combinations to be extracted from the reference corpus;

the reference corpus to be used;

the statistical measures and threshold values to use for extracting the candidate collocations.

Responsible: UnistraPG.

WP3 – Dictionary design: entry structure, graphical and computational interface.

Responsible: UniPG.

WP4 – Extraction and filtering of collocations using statistical measures: final list of collocations.

Responsible: UnistraPG.

WP5 – Assignment of collocations to CEFR levels, and evaluation.

Responsible: UnistraPG.

WP6 – Addition of definitions and examples, and testing their functionality.

Responsible: UnistraPG.

WP7 – Creation of the mobile and desktop application, and interface and quality testing.

Responsible: UniPG.

WP8 – Dissemination of the results.

Responsible: UnistraPG.

Coordinator:

Prof. Stefania SPINA

Team:

Prof. Stefania SPINA

Dr. Irene Fioravanti

Dr. Luciana Forti

Dr. Fabio Zanda (until October 2025)

Navigazione principale

University

Services and opportunities

Educational offer

International mobility

Research

DICI-A Dictionary of Italian Collocations for Learners

ERC Sectors:

Durata:

Activities / Project Phases:

Coordinator:

Team:

Would you like to share this content?