The number of documents available online and in other electronic formats (newspaper articles, digital archives, advertising copy, blogs, etc.) is constantly growing. The rapid growth of digital text material is a valuable source for research, but it also poses a number of challenges for researchers. CLARIN’s mission was to facilitate access to developed language tools and resources and to support the application of language technologies in the humanities and social sciences.
The immediate goal of the CLARIN-PL-BIZ project was to expand the CLARIN-PL research infrastructure into a research and development platform dedicated to natural language processing and the exploration of large-scale linguistic data. This resulted in the creation of a comprehensive infrastructure for building effective and efficient systems for exploring large-scale linguistic data (text and speech). It provides access to universal language technology components and mechanisms for integrating them to build text analysis systems.
The project was carried out by:
- Wrocław University of Technology (including WCSS),
- Institute of Computer Science, Polish Academy of Sciences,
- Institute of Slavic Studies, Polish Academy of Sciences,
- University of Łódź,
- University of Wrocław.