Data Science with R - Capstone Project
In this capstone course, you will apply various data science skills and techniques that you have learned as part of the previous courses in the IBM Data Science with R Specialization or IBM Data Analytics with Excel and R Professional Certificate.For this project, you will assume the role of a Data Scientist who has recently joined an organization and be presented with a challenge that requires data collection, analysis, basic hypothesis testing, visualization, and modeling to be performed on real-world datasets. You will collect and understand data from multiple sources, conduct data wrangling and preparation with Tidyverse, perform exploratory data analysis with SQL, Tidyverse and ggplot2, model data with linear regression, create charts and plots to visualize the data, and build an interactive dashboard.
The project will culminate with a presentation of your data analysis report, with an executive summary for the various stakeholders in the organization.
Write a web scraping program to extract data from an HTML file using HTTP requests and convert the data to a data frame.
Prepare data for modelling by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.
Interpret datawithexploratory data analysis techniques by calculating descriptive statistics, graphing data, and generating correlation statistics.
Build a Shiny app containing a Leaflet map and an interactive dashboard then create a presentation on the project to share with your peers.
Syllabus
Syllabus - What you will learn from this course
Week 1
Module 1 - Capstone Overview and Data Collection
Week 2
Module 2 - Data Wrangling
Week 3
Module 3: Performing Exploratory Data Analysis with SQL, Tidyverse & ggplot2
At this stage of the Capstone Project, you have gained some valuable working knowledge of data collection and data wrangling. You have also learned a lot about SQL querying and visualization. Congratulations! Now it’s time to apply some of your new knowledge and learn about Exploratory Data Analysis (EDA) techniques, again through practice. You can use the datasets you wrangled in the previous Module. However, if you had any issues completing the wrangling, no worries - we have prepared some clean datasets for you to use. You will be asked to complete three labs:
Week 4
Module 4: Predictive Analysis
Week 5
Module 5 - Building a R Shiny Dashboard App
Week 6
Module 6 - Present Your Data-Driven Insights
FAQ
When will I have access to the lectures and assignments?
Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:
The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.
Reviews