Advance Course: Applied R in Data Analytics & Forecasting- Project Base
Categories: Advanced Courses
About Course
This course focuses on applying conditional statements for data cleaning, healthcare automation (e.g., disease categorization, patient communication, critical case flagging), and mastering advanced data structures like datatables and tibbles, alongside list, date, and time manipulation. It covers comprehensive regression analysis, including linear, logistic, Poisson, and negative binomial models, emphasizing assumption validation. Additionally, the curriculum delves into time series analysis (trend, seasonality, stationarity, ARIMA/SARIMA forecasting) and various statistical tests such as ANOVA, ANCOVA, and non-parametric methods, all reinforced through practical application-based assessments.
What Will You Learn?
- A significant portion of the course will focus on the practical application of conditional statements, enabling participants to clean data, generate warning messages, categorize diseases, create personalized patient communications, assign medical priority, flag critical cases, and develop treatment plans within healthcare contexts.
- Further extending these skills, learners will design remote patient monitoring systems, perform insurance risk classification, build AI chatbots, and create pharmacy drug recommendation systems using conditional logic; additionally, they will master higher-level data structures such as datatables, tibbles, and advanced list manipulation.
- The curriculum also includes mastering date and time manipulation with lubridate and implementing advanced piping techniques for streamlined data workflows. Participants will delve into various regression analyses, including linear regression with its critical assumptions (linearity, independence of errors, normality of residuals, homoscedasticity, no multicollinearity), binary and multinomial logistic regression, Poisson regression, and negative binomial regression for count data.
- Advanced topics will cover comprehensive time series analysis, focusing on identifying trends, seasonality, and stationarity, alongside practical forecasting using ARIMA and SARIMA models.
- Finally, the training will equip participants with skills in one-way and two-way ANOVA, ANCOVA, and non-parametric tests like Wilcoxon, Wilcoxon Signed-Rank, and Kruskal-Wallis, culminating in extensive application-based assessments to solidify their understanding of advanced data analysis.
Course Content
ORIENTATION VIDEO
-
Orientation
13:32
DOWNLOAD YOUR FILES HERE
-
Files for Practice
MODULE 2 OF 11: R MARKDOWN- ADDING BEAUTY TO YOUR CODE PRESENTATION (Click to see module content)
This series aims to equip users with the skills to leverage R Markdown for creating professional and visually appealing code presentations. Objectives include mastering file management and knitting processes to generate HTML, Word, and PDF documents. Furthermore, the series will cover techniques for enhancing the aesthetic appeal of R Markdown outputs, including image insertion, clickable links, and basic website development principles, alongside efficient handling of code execution, tables, and graphs.
-
File and Folder Creation in R Markdown
20:00 -
How to Knit into HTML and Word Document
20:00 -
How to Knit into PDF Document
23:18 -
Adding Aestheticism to HTML File in R
18:01 -
How to Create a Dashboard in R Markdown
13:05 -
Creating Clickable Link in R Markdown
10:52 -
Image Insertion in R Markdown
09:49 -
Running Code in R Markdown
00:00 -
Working with Table in R Markdown
14:42 -
R-Markdown Test of Knowledge 1
-
R-Markdown Test of Knowledge 2
DOWNLOAD YOUR FILES HERE
-
Files For Hands-On Learning
MODULE 3 OF 11: DATA CLEANING & MANIPULATION (Click to see module content)
-
Data Filtering
48:36 -
Solution to Assignment on Filtering
10:59 -
Dplyr
23:26 -
Data Cleaning 1
45:52 -
Data Cleaning 2
01:31:05 -
Understanding your Dataset- Tobacco
12:49 -
20 Practical-Based Quiz Questions on Data Collection
-
File Loading with Data.Table and Data Frame
17:27 -
Row Filtering with Data.Table and Data Frame
26:25 -
Column Renaming
18:22 -
Column Selection in Data.Table and Data Frame
22:40 -
Column Transformation
21:10 -
50 Practice Multiple Choice Question on DataTable and Dataframe
-
Data Collection for Analysis
MODULE 4 OF 11: QUANTITATIVE DATA ANALYSIS 1 (Click to see module content)
Upon completing this module, participants will be able to effectively analyze and present relationships between variables by calculating and interpreting correlation coefficients in both tabular and visual formats, utilizing ggplot2 for compelling data visualizations.
Also, participants will be able to build, interpret, and validate various regression models in R, including linear, binary logistic, multinomial, Poisson, and negative binomial regressions for count data. They will also gain proficiency in assessing the critical assumptions of linear regression, such as linearity, independence of errors, normality of residuals, homoscedasticity, and the absence of multicollinearity, to ensure robust model performance.
-
Introduction to ggplot for Visualization
28:00 -
Categorical Data Analysis
16:41 -
Chi-Square, One-Sample, Independent and Paired T-Test
47:45 -
Correlation and Interpretation (Visuals and Tabular)
57:27 -
Applied Theory Assignment 1- Applied Scenarios
MODULE 5 OF 11: QUANTITATIVE DATA ANALYSIS 2 (Click to see module content)
Upon completing this module, participants will be able to effectively analyze and present relationships between variables by calculating and interpreting correlation coefficients in both tabular and visual formats, utilizing ggplot2 for compelling data visualizations.
Also, participants will be able to build, interpret, and validate various regression models in R, including linear, binary logistic, multinomial, Poisson, and negative binomial regressions for count data. They will also gain proficiency in assessing the critical assumptions of linear regression, such as linearity, independence of errors, normality of residuals, homoscedasticity, and the absence of multicollinearity, to ensure robust model performance.
-
Linear Regression
30:36 -
Linearity Assumption
20:34 -
Autocorrelation Assumption
11:03 -
Independence Error Assumption
14:22 -
Normality of Residuals Assumption
00:44 -
Homoscedasticity Assumption
15:10 -
No Multicollinearity Assumption
11:38 -
50 Practice Multiple Choice Questions on Linear Regression
-
Binary Logistic Regression
35:58 -
Multinomial Logistic Regression
23:54 -
Poisson Regression
26:21 -
Negative Binomial Regression
17:33 -
Troubleshooting 1: Number of Iteration
03:59 -
Troubleshooting 2: Multicollinearity
06:12 -
Troubleshooting 3: Standardization
05:28 -
Troubleshooting 4: Outlier
08:17 -
Zero Inflation Poisson Model (ZIP Model)
10:29 -
Zero Inflation Negative Binomial (ZINB)
24:11 -
50 Practice Multiple Choice Question and Answer with Explanation
-
Applied Theory Exercise 2: Generalized Linear Models (GLMs)
MODULE 6 OF 11: PARAMETRIC TEST (Click to see module content)
Upon completing these modules, participants will be able to apply various statistical tests in R to compare group means and distributions. They will gain proficiency in performing one-way ANOVA and two-way ANOVA for parametric data, as well as non-parametric tests such as Wilcoxon, Wilcoxon Signed-Rank, and Kruskal-Wallis for non-normally distributed data.
-
Parametric Test 1: One-Sample, Independent and Paired T-Test
47:45 -
Parametric Test 2: One-Way ANOVA
27:10 -
Parametric Test 3: Two-Way ANOVA
20:53 -
40 Practice Questions with Answers and Explanation
-
Quiz, answer and explanation
-
Applied Theory Exercise 3: Parametric Tests
MODULE 7 OF 11: NON-PARAMETRIC TEST (Click to see module content)
Upon completing these modules, participants will be able to apply various statistical tests in R to compare group means and distributions. They will gain proficiency in performing one-way ANOVA, two-way ANOVA, and ANCOVA for parametric data, as well as non-parametric tests such as Wilcoxon, Wilcoxon Signed-Rank, and Kruskal-Wallis for non-normally distributed data.
-
Non-parametric Test Wilcoxon
17:30 -
Wilcoxon Signed-Rank with short note
08:34 -
Non-parametric Test Kruskal-Wallis with short note
11:36 -
50 Practice Questions with Answers and Explanation
-
Applied Theory Exercise 5: Non-Parametric Tests
MODULE 8 OF 11: QUALITATIVE DATA ANALYSIS- QDA (Click to see module content)
This module is designed to equip students with the skills to effectively interpret and analyze unstructured qualitative data. The primary objective is to move beyond mere observation, enabling students to uncover rich insights and meaningful patterns from text, interviews, and open-ended responses. Students will master techniques for identifying key themes and developing robust coding frameworks. Ultimately, this content aims to build proficiency in qualitative research methods, allowing students to support business decisions with evidence-based narratives. Upon completion, students will be able to synthesize complex information and present compelling stories hidden within the data. The final goal is to empower a new generation of data analysts who can provide a deeper, more human-centric understanding of an organization's challenges.
-
Introduction to Qualitative Data Analysis
00:00 -
QDA- Data Loading and Cleaning
39:54 -
QDA- Word Frequency and WordCloud
19:44 -
QDA- Bigram
13:41 -
N-gram Visualization
13:15 -
Lexicon-Based Sentiment Analysis
23:31 -
Topic Modelling
38:45 -
QDA- Concordance
17:41 -
Sentiment by Segmentation
26:54 -
Natural Language Processing (NLP)- Data Preparation
49:57 -
Natural Language Processing (NLP)- Product Description
18:32 -
Natural Language Processing (NLP)- Analyzing Concept
08:45 -
Natural Language Processing (NLP)- Analyzing Activities
10:18 -
50 Practical-Based Quiz Questions on Qualitative Data Analysis in R
-
Real World Mini-project: Qualitative Data Analysis
MODULE 9 OF 11 (ADD-ON): TIME SERIES ANALYSIS AND FORECASTING (Click to see module content)
Upon completing these modules, participants will be able to perform comprehensive time series analysis in R, including identifying and modeling trends, seasonality, and stationarity. They will also gain proficiency in forecasting future values using both ARIMA and SARIMA models.
-
Time Series Analysis- General Overview
27:04 -
Time Series Analysis- Trend
27:04 -
Seasonality 1- Subseries
15:27 -
Seasonality 2- Weekly
12:09 -
Seasonality 3- Autocorrelation
10:17 -
Time Series Analysis- Stationarity
08:55 -
Forecasting with ARIMA Model
17:18 -
Forecasting with SARIMA Model
14:10 -
60 Practical-Based Quiz Questions on Time Series Analysis and Forecasting in R
MODULE 10 OF 11 (ADD-ON): DATA SCIENCE IN HEALTHCARE (Click to see module content)
Upon completing these modules, participants will be able to apply conditional statements in R to automate critical healthcare tasks, including disease categorization, patient message personalization, and medical priority assignment.
-
Disease Categorization
40:33 -
How to Create Personalized Message for Patients/ Clients in R
12:02 -
Assigning Medical Priority Level with Conditional Statement
11:39 -
How to Flag Critical Cases in Healthcare
11:07 -
Flagging Patients who Need Urgent Medical Check-up
09:25 -
Assigning Treatment Plan with R Programming Language
09:53 -
Creating Remote Patient Monitoring System
07:48 -
Risk Classification in Health Insurance
07:27 -
How to Create AI Chat Box
08:36 -
Pharmacy Drug Recommendation System with R
06:39 -
50 Practical-Based Quiz Questions
MODULE 1 OF 11: FOUNDATIONAL COURSE REVISION (Click to see module content)
A foundational course in R programming for data analytics typically covers the essential skills needed to manipulate, analyze, and visualize data using R.
-
Introduction: Why You Should Learn R Programming Language
00:00 -
Lesson 1, Practice 1: R and R Studio Explained with Practice
19:30 -
Lesson 2: Let’s Get Started with Some Installations
-
Lesson 3, Practice 2: Let Us Code Together! It is Easier than You Imagine.
16:43 -
Lesson 4, Practice 3 with Note: Syntax- Rules of Writing Code in R Programming Language
01:02:28 -
Lesson 5: Best Practice in R Syntax
-
Lesson 6, Practice 4: Assignment Solution in Syntax
15:19 -
Lesson 7, Practice 5: Data Structure in R- Vector, Matrices, List and Data Frame
01:17:16 -
Lesson 8, QUIZ 1: Syntax and Data Structure True or False Quizzes with Explanations
-
Lesson 9, Practice 6: Assignment Solution in Data Structure
32:34 -
Lesson 10, Multiple Choice Questions on Syntax and Data Structure
-
Lesson 11, Practice 7 with Note: Data Importation into R Studio
42:23 -
Lesson 12, Practice 8: Uploading files into R studio
03:53 -
Lesson 13: Work smart in R Studio Cloud
19:52 -
Lesson 14: Practice 9 with Note: Data Exploration
41:45 -
Lesson 15, Practice 10: Assignment Solution in Data Exploration
12:35 -
Lesson 16: Quiz on Data Exploration
-
Lesson 17, Practical Quiz for Application
-
Lesson 18, Practical Quiz for Application
-
Lesson 19, Practice Field 1: 37 Question & Answer Test of Knowledge
-
Lesson 20, Practice Field 2: 20 Questions of Application
-
Lesson 22, Practice Field 4: 100 Pure Application Questions & Answers
MODULE 11 OF 11: ETHICS OF DATA ANALYTICS
This course provides an extensive overview of the ethics of data analytics, covering foundational principles, practical challenges throughout the data life cycle, real-world case studies, and modern governance. It equips students to navigate complex ethical dilemmas by applying frameworks like deontology and utilitarianism. The content addresses key issues such as algorithmic bias, data privacy, and accountability. It also examines the ethical implications of data use in critical fields like healthcare and criminal justice. Finally, the course explores professional best practices and major regulations like GDPR, preparing students to develop and implement data solutions responsibly.
-
Foundational Principles of Data Ethics
-
The Data Life Cycle and Ethical Challenges
-
Ethical Applications and Case Studies
-
Regulation, Governance, and the Future
-
References
-
50 Practical Based Question and Answer with Explanation
BTICL PROJECT AND SUBMISSION GUIDELINE
-
Cover Page Guideline
-
Executive Summary
-
Guideline for Introduction
-
Methodology
-
Result and Analysis
-
Conclusion and Recommendation
-
References
-
Appendix Page
-
Formatting Guide
-
Plagiarism Policy
-
Grading Rubric (100 Marks)
-
Submission Instructions
PROJECT TOPIC
-
FINAL PROJECT QUESTION
-
FINAL PROJECT- SUBMIT HERE
Student Ratings & Reviews
No Review Yet