Data Science Fundamentals

Aims and objectives

This unit will give students a solid foundation in contemporary data science best practices using Python. It will cover a hands-on introduction to programming paradigms and fundamental data analysis techniques. Through examples involving real-world data, students will learn data cleaning and validation techniques, data transformation procedures, algorithm design, text analytics, and data visualisation techniques. Students will become familiar with important Python software modules such as Pandas, Matplotlib, and the Natural Language Toolkit (NLTK).