tutorials:b4b

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revisionBoth sides next revision
tutorials:b4b [2019/11/05 17:16] – [How to find some help] albarrontutorials:b4b [2019/11/05 17:42] – [Exercises] albarron
Line 77: Line 77:
 ==== Exercises ==== ==== Exercises ====
  
-**EXERCISE 1**. Let us "measure" the file: bytes, megabytes, lines, words, etc.+**EXERCISE 1**. Let us "measure" file: bytes, megabytes, lines, words, etc.
  
 **EXERCISE 2**. Shuffle a parallel corpus in order to have sentences from different speeches.  **EXERCISE 2**. Shuffle a parallel corpus in order to have sentences from different speeches. 
Line 83: Line 83:
 **EXERCISE 3**. Find the most frequent tokens in the two parts of a parallel corpus and analyse them. **EXERCISE 3**. Find the most frequent tokens in the two parts of a parallel corpus and analyse them.
  
-**EXERCISE 4**. Get all words which are cognates wrt Italian from a tsv dictionary.+**EXERCISE 4**. Get all words which are cognates wrt Italian from a tsv dictionary. Afterwards, count the number of tokens which belong to each family
  
  • tutorials/b4b.txt
  • Last modified: 2019/11/06 11:36
  • by albarron