Welcome to GriCo homepage. GriCo is a annotated corpus containing the materials published on the Movimento 5 Stelle blog, created and maintained by a multidisciplinary team. It is available in different formats and with different types of annotations.
Available as annotated XML, including text’s details from the website.
Contains all materials from 2005 to 2018.
The metadata structure preserves the original materials’ hierarchy.
Choose between the version tagged with TreeTagger or spacy.io.
≈ 7mln tokens for posts, ≈ 440mln tokens for comments.
Also available in plain .txt format.
html
) from ilblogdellestelle.itHuge range of potential uses for the GriCo corpus as it covers more than 10 years and includes both blog posts and comments. #CLconf2019 pic.twitter.com/LLN7nJjgB0
— Charlotte Taylor (@_ctaylor_) July 25, 2019
What a great way to start the day!!! Great presentation on the use of #CorpusLinguistics to study political discourse!! Thank you @angela_zottola @Fab_Hunter @elena_valvason @genderedform @matteodic Virgina Zorzi #CLconf2019 pic.twitter.com/DH440Cjz5k
— Debbie Cabral (@debbie_cab) July 25, 2019
Today starts with learning about the new GriCo corpus - one of the first covering Italian polical discourse. Designed and built by @Fab_Hunter@angela_zottola @genderedform#clconf2019 @matteodic Virginia Zorzi & Elena Valvason pic.twitter.com/pYLlrOllI9
— Charlotte Taylor (@_ctaylor_) July 25, 2019