Undergraduate Certificate in Computational Linguistics
Program Overview
The Undergraduate Certificate in Computational Linguistics at San Francisco State University provides students with a foundation in computational language analysis. Through coursework in Python programming, statistical methods, and machine learning, students learn to analyze and process raw texts, identify linguistic patterns, extract meaning, and perform data mining. The program prepares students for careers in various fields that utilize computational linguistics techniques.
Program Outline
Degree Overview:
The Undergraduate Certificate in Computational Linguistics at San Francisco State University is designed to provide students with academic training in the study of computational approaches to language analysis. It assumes no prior linguistic or programming knowledge, and introduces students to a variety of computational methods and their theoretical underpinnings, including:
- Writing programs in Python to process raw texts (tokenization)
- Discovering statistical patterns in linguistic data (frequency distribution)
- Extracting meaning from texts
- Applying various machine learning methods to data mining
Program Learning Outcomes:
Upon successful completion of the program, students will be able to:
- Identify grammatical categories and basic principles of phonological and syntactic grammar.
- Write programs in a programming language (e.g., Python) and process raw texts.
- Discover statistical patterns in linguistic data, identify frequency distributions, and perform tokenization.
- Build dependency grammar and extract meaning from texts.
- Apply various machine learning methods to data mining.
Other:
The certificate requires 15 units of coursework.