Course home for
LING 1340/2340
HOME
• Policies
• Term project guidelines
• Learning resources by topic
• Schedule table
*Class schedule is subject to revision throughout the semester.
W | Date | Due (before class @ 1pm) | Topics Tools |
|
#To-do/Homework Project |
||||
1 | 1/19 | [slides] Course introduction, setup | ||
1/21 | #1 | [slides] Data in linguistics | ||
2 | 1/26 | Homework 1: Explore linguistic data | [slides] Processing linguistic data | |
1/28 | #2 | Data processing fundamentals, statistics | [slides, JNB] Python's numpy library | |
3 | 2/2 | #3 | [slides, JNB] Data frames with pandas | |
2/4 | #4 | [JNB] More pandas, text processing, stats | ||
4 | 2/9 | [JNB] Stats crash course, visualization | ||
2/11 | Homework 2: Process the ETS corpus (1st half) | [JNB, JNB] Stats continued, HW2 review | ||
5 | 2/16 | #5, HW2 (2nd half) | Open access & data publishing
[slides] Guest speaker Lauren Collister |
|
2/18 | Data mining | [JNB, JNB] HW2 review, mining web & social media | ||
6 | Self-care day: No class | |||
2/25 | #6 | Corpus linguistics, annotation | [slides] Corpora: data formats | |
7 | 3/2 | #7 | [slides] Linguistic annotation | |
3/4 | Machine learning | [slides, JNB] Annotation Ctd, Regression | ||
8 | 3/9 | #8 | [JNB] Classifiers: NB, count vectors, TF-IDF | |
3/11 | #9 | [JNB] SVC, categorical data, cross-validation | ||
9 | 3/16 | Homework 3: Machine learning with ETS data | [JNB, JNB] Homework 3 review | |
3/18 | #10 | [JNB] HW3 continued | ||
10 | 3/23 | Big data at CRC, and machine learning (continued), and advanced NLP | [slides] Command-line tools | |
3/25 | #11 (new: 1-day extension) | [JNB, slides] Supercomputing at CRC, HW3 wrap: dimensionality reduction, ensemble model | ||
11 | 3/30 | #12 | [slides, JNB] Computational efficiency (by Joey), Grid Search and parallel processing | |
4/1 | #13 | [JNB, JNB, JNB] Big data wrangling, grid search continued, clustering & topic modeling | ||
12 | 4/6 | Homework 4: Supercomputing Yelp Data | [repo, JNB] Homework 4 review, advanced NLP | |
4/8 | #14 | Speech & multimedia | [slides] Speech data and corpora | |
13 | 4/13 | [slides] ASR theory, forced aligner | ||
4/15 | #15 | Project presentations | SC, EM, [slides] Multimodal data | |
14 | 4/20 | AC, JP, FH | ||
4/22 | LB, MB, ET | |||
15 | 5/2 | No class: finals week |