Course home for
LING 1340/2340
HOME 
• Policies
• Term project guidelines
• Learning resources by topic
• Schedule table
*Class schedule is subject to revision throughout the semester.
| W | Date | Due (before class @ 1pm) | Topics Tools |
|
| #To-do/Homework Project |
||||
| 1 | 1/19 | [slides] Course introduction, setup | ||
| 1/21 | #1 | [slides] Data in linguistics | ||
| 2 | 1/26 | Homework 1: Explore linguistic data | [slides] Processing linguistic data | |
| 1/28 | #2 | Data processing fundamentals, statistics | [slides, JNB] Python's numpy library | |
| 3 | 2/2 | #3 | [slides, JNB] Data frames with pandas | |
| 2/4 | #4 | [JNB] More pandas, text processing, stats | ||
| 4 | 2/9 | [JNB] Stats crash course, visualization | ||
| 2/11 | Homework 2: Process the ETS corpus (1st half) | [JNB, JNB] Stats continued, HW2 review | ||
| 5 | 2/16 | #5, HW2 (2nd half) | Open access & data publishing
[slides] Guest speaker Lauren Collister |
|
| 2/18 | Data mining | [JNB, JNB] HW2 review, mining web & social media | ||
| 6 | Self-care day: No class | |||
| 2/25 | #6 | Corpus linguistics, annotation | [slides] Corpora: data formats | |
| 7 | 3/2 | #7 | [slides] Linguistic annotation | |
| 3/4 | Machine learning | [slides, JNB] Annotation Ctd, Regression | ||
| 8 | 3/9 | #8 | [JNB] Classifiers: NB, count vectors, TF-IDF | |
| 3/11 | #9 | [JNB] SVC, categorical data, cross-validation | ||
| 9 | 3/16 | Homework 3: Machine learning with ETS data | [JNB, JNB] Homework 3 review | |
| 3/18 | #10 | [JNB] HW3 continued | ||
| 10 | 3/23 | Big data at CRC, and machine learning (continued), and advanced NLP | [slides] Command-line tools | |
| 3/25 | #11 (new: 1-day extension) | [JNB, slides] Supercomputing at CRC, HW3 wrap: dimensionality reduction, ensemble model | ||
| 11 | 3/30 | #12 | [slides, JNB] Computational efficiency (by Joey), Grid Search and parallel processing | |
| 4/1 | #13 | [JNB, JNB, JNB] Big data wrangling, grid search continued, clustering & topic modeling | ||
| 12 | 4/6 | Homework 4: Supercomputing Yelp Data | [repo, JNB] Homework 4 review, advanced NLP | |
| 4/8 | #14 | Speech & multimedia | [slides] Speech data and corpora | |
| 13 | 4/13 | [slides] ASR theory, forced aligner | ||
| 4/15 | #15 | Project presentations | SC, EM, [slides] Multimodal data | |
| 14 | 4/20 | AC, JP, FH | ||
| 4/22 | LB, MB, ET | |||
| 15 | 5/2 | No class: finals week | ||