Course home for
LING 1340/2340
HOME
• Policies
• Term project guidelines
• Learning resources by topic
• Schedule table
*Class schedule is subject to revision throughout the semester.
W | Date | Due (before class @ 3:45pm) | Topics Tools |
|
#To-do/Homework Project |
||||
1 | 1/11 | [slides] Course introduction, setup | ||
1/13 | #1 | [slides] Data in linguistics | ||
2 | 1/18 | Homework 1: Explore linguistic data | [slides] Processing linguistic data | |
1/20 | #2 | Data processing fundamentals, Statistics | [slides, JNB] Python's numpy library | |
3 | 1/25 | #3 | [slides, JNB] Data frames with pandas | |
1/27 | #4 | [slides] More pandas, text processing, stats | ||
4 | 2/1 | [JNB] Stats crash course, visualization | ||
2/3 | Homework 2: Process the ETS corpus (1st half) | [JNB] Stats continued, HW2 review | ||
5 | 2/8 | HW2 (2nd half) | [JNB] HW2 review (ctd) | |
2/10 | #5 (due @2pm!!) | Open access & data publishing, Corpora, Annotation, Data mining | Guest speaker Lauren Collister | |
6 | 2/15 | [slides, JNB] Corpora: data formats, mining web & social media | ||
2/17 | #6 | [slides] Linguistic annotation | ||
7 | 2/22 | #7 | [slides] Annotation continued | |
2/24 | 1st progress report (due noon 27th) |
Machine learning | [JNB] Regression | |
8 | 3/1 | #8 | [JNB] Classifiers: NB, count vectors, TF-IDF | |
3/3 | #9 | [JNB] SVC, categorical data, cross-validation | ||
No class: Spring break | ||||
9 | 3/15 | Homework 3: Machine Learning with ETS data | ML (ctd) | [slides, JNB, JNB] HW 3 review |
3/17 | #10 | [JNB] HW 3 review | ||
10 | 3/22 | #11 | Big data at CRC, and Machine learning (ctd), and Advanced NLP | [slides, JNB] HW3 wrap: dimensionality reduction, ensemble model, command line |
3/24 | [slides] Supercomputing, command line tools | |||
11 | 3/29 | #12 | [slides] PCA by Sean Steinle, running jobs on CRC | |
3/31 | #13 | [slides, JNB] Computational efficiency, big data wrangling, OnDemand on CRC | ||
12 | 4/5 | Homework 4: Supercomputing Yelp Data | [JNB, JNB] Clustering & topic modeling, advanced NLP | |
4/7 | #14 | Speech & multimedia | [slides] Speech data and corpora | |
13 | 4/12 | [slides, JNB] Forced aligner, ASR | ||
4/14 | [slides, slides] ELAN demo by Lindsey Rojtas, ASR theory, Misha | |||
14 | 4/19 | Ben, Kinan, Man Ho, Alejandro | ||
4/21 | Caroline, Emma, Rohan, Tianyi | |||
15 | 5/1 6pm |
Finals week |