Welcome to CS250: Python for Data Science
Specific information about this course and its requirements can be found below. For more general information about taking Saylor Academy courses, including information about Community and Academic Codes of Conduct, please read the Student Handbook.
Course Description
Learn data science using the Python programming language by looking at data processing, data analysis, visualization, data mining, and statistical models. By the end of this course, you will be able to implement Python code for these data science topics.
Course Introduction
This course attempts to strike a balance between presenting the vast set of methods within the field of data science and Python programming techniques for implementing them. Problem-solving and programming implementation will be emphasized throughout the course. All techniques presented will be introduced using real-world programming examples. A major goal of the course is to ensure that when you finish the course, you will have the programming and conceptual expertise you need to join the field of data science.
Several Python modules, such as pandas, scikit-learn, scipy.stats, and statsmodels, will be introduced that are useful for data analysis, data visualization, and data mining. The course will gradually shift from introductory topics such as a review of Python, matrix operations, and statistics to applications and implementing programs involving data mining, visualization, statistical models, and time series analysis.
This course includes the following units:
- Unit 1: What is Data Science?
- Unit 2: Python for Data Science
- Unit 3: The numpy Module
- Unit 4: Applied Statistics in Python
- Unit 5: The pandas Module
- Unit 6: Visualization
- Unit 7: Data Mining I – Supervised Learning
- Unit 8: Data Mining II – Clustering Techniques
- Unit 9: Data Mining III - Statistical Modeling
- Unit 10: Time Series Analysis
Course Learning Outcomes
Upon successful completion of this course, you will be able to:
- Use Google Colaboratory notebooks to implement and test Python programs;
- Explain how Python programming is relevant to data science;
- Construct and operate on arrays using the numpy module;
- Apply Python modules for basic statistical computation;
- Construct and operate on dataframes using the pandas module;
- Apply the pandas module to interact with spreadsheet software;
- Implement Python scripts for visualization using arrays and dataframes;
- Apply the scikit-learn module to perform data mining;
- Explain techniques for supervised and unsupervised learning;
- Apply supervised learning techniques;
- Apply unsupervised learning techniques;
- Apply the scikit-learn module to build statistical models;
- Implement Python scripts to perform regression analyses;
- Apply the statsmodels module to build and analyze models for time series analysis; and
- Explain similarities and differences between AR, MA, and ARIMA models.
Throughout this course, you will also see learning outcomes in each unit. You can use those learning outcomes to help organize your studies and gauge your progress.
Course Materials
This course's primary learning materials are articles, lectures, and videos.
All course materials are free to access and can be found in each unit of the course. Pay close attention to the notes that accompany these course materials, as they will tell you what to focus on in each resource and will help you understand how the learning materials fit into the course as a whole. You can also see a list of all the learning materials in this course at this link.
Some parts of this course may have been created or reviewed with the support of artificial intelligence (AI). To make sure you receive accurate, high-quality, and academically sound learning materials, all AI-assisted content is carefully checked and approved by Saylor Academy's faculty and subject matter experts.
Evaluation and Minimum Passing Score
Only the final exam is considered when awarding you a grade for this course. To pass this course, you will need to earn a grade of 70% or higher on the final exam.
Your score on the exam will be calculated as soon as you complete it. Be sure to study in between each attempt! If you do not pass the exam, you will not complete this course or receive a certificate of completion. You can attempt the exam as many times as you want.
There are end-of-unit assessments in this course that are designed to help you study and do not factor into your final course grade. You can take them as many times as you want until you understand the concepts they cover.
You can see all of these assessments at this link.
Continuing Education Credits
The certificate earned by passing this self-paced course displays the program hours you completed and continuing education credits (CEUs). CEUs document successful completion of courses that are designed to improve the knowledge and skills of working adults. Many industries value CEUs, and now your certificate reflects them clearly, and they may be used to support career advancement or to meet professional licensing standards. This course contains 6.7 CEUs.
Tips for Success
CS250: Python for Data Science is a self-paced course, meaning you can decide when to start and complete the course. We estimate the "average" student will take 67 hours to complete. We recommend studying at a comfortable pace and scheduling your study time in advance.
Learning new material can be challenging, so here are a few study strategies to help you succeed:
- Take notes on terms, practices, and theories. This helps you understand each concept in context and provides a refresher for later study.
- Test yourself on what you remember and how well you understand the concepts. Reflecting on what you've learned improves long-term memory retention.
Technical Requirements
This course is delivered entirely online. You will need access to a computer or web-capable mobile device and consistent internet access to view or download resources and complete auto-graded assessments and the final exam.
To access the full course, including assessments and the final exam, log into your Saylor Academy account and enroll in the course. If you don't have an account, you can create one for free here. Note that tracking progress and taking assessments require you to log in.
For more details and guidance, please review our complete Technical Requirements and our student Help Center.
Optional Saylor Academy Mobile App
You can access all course features directly from your mobile browser, but if you have limited internet connectivity, the Saylor Academy mobile app provides an option to download course content for offline use. The app is available for iOS and Android devices.
Fees
This course is entirely free to enroll in and access. All course materials, including textbooks, videos, webpages, and activities, are available at no charge. This course also contains a free final exam and a free course completion certificate.