Skip to content

TurkuNLP/ATP_kurssi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ATP_kurssi

This page includes all the materials for the course KKLT0030 Automatic text processing 5 credits.

The course Moodle page has private materials, such as possible recordings and announcements.

Mon Oct 27

  • Getting started
  • Notebook 1
  • Commands
    • Getting data and printing stuff: wget, echo
    • Printing files: cat, head, tail
    • Copying, renaming, removing: cp, mv, rm
    • Others: wc -w, ls

Thur Oct 30

  • Notebook2
  • Commands: egrep, sort, uniq
  • Options
    • egrep -v, -i, -w, -c, -B, -A
    • head -n, tail -n
    • wc -l, -w
    • uniq -c, sort -r, -n
  • Pipes, especially frequency counts
    • sort | uniq -c | sort -rn

Mon Nov 3

  • Notebook3 exercises

Thur Nov 6

  • Notebook4
  • Git clone for cloning Github reports
  • Gzipped files using gzip and zcat
  • Changing characters using tr
    • Combining tr to a frequency list pipeline
    • Using tr to normalize
  • Regular expressions

Mon Nov 10

  • Notebook 5 exercies

Thur Nov 13

  • Notebook 6
  • Dependency syntax analysis pipeline
  • Sentence + token segmentation, lemmatisation, POS, dependencies
  • conllu format
  • Universal dependencies treebanks
  • Trankit parser

Mon Nov 17

  • Notebook 7
  • recap

Thur Nov 20

  • Notebook 8
  • Working on the server
  • Directory structure, files and folders

Mon Nov 24

  • Notebook 8 cont'd
  • Scripts
  • Stdin/stdout, arguments

Thur Nov 27

  • Notebook 9
  • Recap of Notebook 8 subjects

Mon Dec 1

  • Notebook 9
  • Perl substitution

Thur Dec 4

  • Notebook 10
  • For loops

Mon Dec 8

  • More for loops
  • Recap

Thur Dec 11

  • FULL RECAP
  • Q&A

Mon Dec 15

  • EXTRA: Python with Bash

Thur Dec 18

  • EXTRA: Python with Bash

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5