Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revisionBoth sides next revision | ||
tutorials:b4b [2019/11/05 15:31] – [Storing the results] eros | tutorials:b4b [2019/11/05 17:16] – [How to find some help] albarron | ||
---|---|---|---|
Line 15: | Line 15: | ||
* **Linux**. Nothing extra. You are ready to go. | * **Linux**. Nothing extra. You are ready to go. | ||
+ | ===== Resources ===== | ||
+ | |||
+ | We'll use a small subset of the English-Italian part of the Europarl parallel corpus. | ||
+ | |||
+ | Download the two files here: {{: | ||
===== Why is bash relevant? ===== | ===== Why is bash relevant? ===== | ||
Line 41: | Line 46: | ||
Commands: '' | Commands: '' | ||
+ | |||
+ | |||
==== Grabbing information in a file from the command line ==== | ==== Grabbing information in a file from the command line ==== | ||
Line 67: | Line 74: | ||
Commands: '' | Commands: '' | ||
+ | |||
+ | ==== Exercises ==== | ||
+ | |||
+ | **EXERCISE 1**. Let us " | ||
+ | |||
+ | **EXERCISE 2**. Shuffle a parallel corpus in order to have sentences from different speeches. | ||
+ | |||
+ | **EXERCISE 3**. Find the most frequent tokens in the two parts of a parallel corpus and analyse them. | ||
+ | |||
+ | **EXERCISE 4**. Get all words which are cognates wrt Italian from a tsv dictionary. | ||
+ |