| Both sides previous revision Previous revision | |
| corpora:citra:start [2026/04/29 12:13] – dpolizzi | corpora:citra:start [2026/04/29 12:16] (current) – dpolizzi |
|---|
| |
| === Metada scheme === | === Metada scheme === |
| ^ Metadata field ^ Metadata value ^ Description ^ | ^ Metadata field ^ Metadata value ^ Description ^ |
| | id | e.g. 2024_pr_rev_001.txt |file's ID, encapsulating selection criteria | | | id | e.g. 2024_pr_rev_001.txt | file's ID, encapsulating selection criteria | |
| | text type | informative; imaginative |the 1<sup>st</sup> level domain to which the text is assigned based on functional criteria (text types) | | | text type | informative; imaginative | the 1<sup>st</sup> level domain to which the text is assigned based on functional criteria (text types) | |
| | genre | press; general prose; learned writing; fiction |the 2<sup>nd</sup> level domain to which the text is assigned based on structural conventions | | | genre | press; general prose; learned writing; fiction | the 2<sup>nd</sup> level domain to which the text is assigned based on structural conventions | |
| | subgenre | e.g. editorial; novel |the 3<sup>rd</sup> level domain to which the text is assigned based on structural conventions | | | subgenre | e.g. editorial; novel | the 3<sup>rd</sup> level domain to which the text is assigned based on structural conventions | |
| | topic | e.g. politics; fantasy, thriller |the subject matter around which the text is built. Multiple co-occurring topics are separated through a comma| | | topic | e.g. politics; fantasy, thriller | the subject matter around which the text is built. Multiple co-occurring topics are separated through a comma| |
| | publication_year | e.g. 2025 |the year when the text was issued | | | publication_year | e.g. 2025 | the year when the text was issued | |
| | publication_type | print; digital; print and digital |the medium through which the text is made available | | | publication_type | print; digital; print and digital | the medium through which the text is made available | |
| | publisher | e.g. Consiglio Nazionale delle Ricerche |the individual, company or entity producing and distributing the text | | | publisher | e.g. Consiglio Nazionale delle Ricerche | the individual, company or entity producing and distributing the text | |
| | author_gender | man; men; woman; women; mixed; N/a |authors' assumed gender, catering to both individual as well as collaborative production| | | | author_gender | man; men; woman; women; mixed; N/a | authors' assumed gender, catering to both individual as well as collaborative production| | |
| | word_count | e.g. 781 |the total number of tokens from the cleaned text as reported in the text editor | | | word_count | e.g. 781 | the total number of tokens from the cleaned text as reported in the text editor | |
| | url | e.g. https://cineforum.it/recensione/Nosferatu |a link directing to the original text file | | | url | e.g. https://cineforum.it/recensione/Nosferatu | a link directing to the original text file | |
| |
| ==== License and conditions of use ==== | ==== License and conditions of use ==== |