Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revisionBoth sides next revision | ||
tutorials:b4b [2019/11/04 17:32] – [Why is bash relevant?] albarron | tutorials:b4b [2019/11/05 17:34] – [Exercises] albarron | ||
---|---|---|---|
Line 15: | Line 15: | ||
* **Linux**. Nothing extra. You are ready to go. | * **Linux**. Nothing extra. You are ready to go. | ||
+ | ===== Resources ===== | ||
+ | |||
+ | We'll use a small subset of the English-Italian part of the Europarl parallel corpus. | ||
+ | |||
+ | Download the two files here: {{: | ||
===== Why is bash relevant? ===== | ===== Why is bash relevant? ===== | ||
Line 40: | Line 45: | ||
Files can be simply displayed (without performing any modification) or actually opened for edition purposes. You will learn to do both. | Files can be simply displayed (without performing any modification) or actually opened for edition purposes. You will learn to do both. | ||
- | Commands: '' | + | Commands: '' |
+ | |||
==== Grabbing information in a file from the command line ==== | ==== Grabbing information in a file from the command line ==== | ||
Line 51: | Line 58: | ||
All the operations carried out show their result in the terminal, but do not alter the contents nor are stored anywhere. Now we learn how to store them. | All the operations carried out show their result in the terminal, but do not alter the contents nor are stored anywhere. Now we learn how to store them. | ||
- | Commands: | + | Commands: ''>'', |
==== Understanding the structure of the commands ==== | ==== Understanding the structure of the commands ==== | ||
Line 67: | Line 74: | ||
Commands: '' | Commands: '' | ||
+ | |||
+ | ==== Exercises ==== | ||
+ | |||
+ | **EXERCISE 1**. Let us " | ||
+ | |||
+ | **EXERCISE 2**. Shuffle a parallel corpus in order to have sentences from different speeches. | ||
+ | |||
+ | **EXERCISE 3**. Find the most frequent tokens in the two parts of a parallel corpus and analyse them. | ||
+ | |||
+ | **EXERCISE 4**. Get all words which are cognates wrt Italian from a tsv dictionary. Afterwards, count the number of tokens which belong to each family. | ||
+ |