Skip to Content
Course

Capstone in Applied Data Science with R

CSE-41410

Showcase your R expertise with a real-world, AI-powered analytics project

This R Analytics Capstone Project course integrates skills from the three required courses in the "R for Data Analytics" program, enabling students to create a professional portfolio project (e.g. Quarto book, R-Shiny interactive visualization, or similar) using AI assistance. Focusing on data collection, cleaning, analysis, and visualization from provided sources, this course prepares professionals, researchers, and students for real-world analytics challenges.   

Course Highlights:

  • Introduction to Quarto Book Creation: Learning to structure and format a Quarto book for project documentation, including text, code, and output integration.
  • Data Collection from Multiple Sources: Selecting and importing data from provided datasets, preparing for analysis
  • Data Cleaning and Preparation: Techniques to handle missing values, inconsistencies, and transformations in R
  • Statistical Analysis Selection: Choosing and implementing a statistical method (e.g., regression, t-test) based on project needs
  • Table and Figure Generation: Creating tables with "flextable" and visualizations to present results clearly
  • Programming Skills: Writing custom functions and loops to automate and optimize data tasks
  • Project Summary and Reporting: Crafting a comprehensive report in Quarto, integrating analysis, visuals, and conclusions
  • AI-Assisted Project Development: Utilizing AI tools (e.g., for code suggestions, report polishing) to enhance project quality

Course Learning Outcomes:

  • Synthesize skills from prior R courses to design and execute a comprehensive data analytics project, documented in a professional R project
  • Apply data collection, cleaning, and statistical analysis techniques to real-world datasets, selecting from provided sources
  • Create publication-ready tables and figures to communicate findings effectively
  • Develop reusable R functions and loops to streamline data processing, enhancing project efficiency
  • Produce a detailed summary report, leveraging AI tools to refine analysis and presentation for professional use

Course Typically Offered: Online in spring and fall quarters

Prerequisites: Students must complete all three required courses in the R for Data Analytics certificate in order to enroll into this course which includes CSE-41097 Introduction to R Programming, CSE-41198 Introduction to Statistics using R, and CSE-41408 Advanced Data Wrangling and Visualization in R.

Next Step: After completing this course, consider taking CSE-41396 Practical R for the Pharmaceutical Industry to continue learning. 

Contact: For more information about this course, please email unex-techdata@ucsd/edu.




 

Course Information

2.00 units
TBD

Course sessions

Please contact the Science & Technology department at 858-534-3229 or unex-sciencetech@ucsd.edu for information about this course and upcoming sections.