


Processing Actionable Data in Genomics
BIOL-40327
Experience bioinformatics approaches to processing genomic data and distilling consequential biomedical information. Genomic sequencing technologies deliver massive volumes of data that require dedicated handling methods and diverse abilities for troubleshooting. This hands-on course focuses on UNIX-based analysis tasks commonly employed in the field. Instruction covers general considerations ranging from experiment configuration, data QC, and software systems, to tuning of algorithms and visualization of results. In addition, assigned work with public datasets will provide students with weekly hands-on experience with several widely used techniques, including read mapping, variant calling, annotation, and pipeline construction. Class sessions consist of video lectures covering slides, some assigned reading, quizzes, and directed forum participation. Mandatory data processing homework is a significant part of this class.
By the end of the course, you will be able to:
- Compare the utility of different types of sequencing experiments
- Structure bioinformatic pipelines to fit the experiment
- Implement steps in bioinformatic processing
- Critique genomic processing results
For more information on this course, please contact: appliedscience@ucsd.edu
Course Information
Course sessions
Section ID:
Class type:
This course is entirely web-based and to be completed asynchronously between the published course start and end dates. Synchronous attendance is NOT required.
You will have access to your online course on the published start date OR 1 business day after your enrollment is confirmed if you enroll on or after the published start date.
Textbooks:
No textbook required.
Policies:
- No refunds after: 10/6/2025
Schedule:
Instructor:
Fatemeh Zare

Dr. Fatima Zare is a seasoned bioinformatics data scientist with over 8 years of experience, currently serving as the Lead Data Scientist at bioMérieux. She brings a robust background in mathematics and a passion for data, excelling in data cleaning, modeling, and uncovering actionable insights.
Specializing in tools like Python and R, Dr. Zare leverages libraries such as Pandas, NumPy, TensorFlow, and Scikit-Learn to address complex machine learning challenges, including regression, clustering, natural language processing (NLP), and time series analysis. She is proficient in platforms like Jupyter Notebook, Google Colab, and PyCharm, and ensures efficient project management using tools like Jira and Git.
Dr. Zare holds a Ph.D. in Computer Science & Engineering from the University of Connecticut and has conducted advanced research as a Postdoctoral Fellow at Harvard University. With a deep commitment to education, she combines her analytical expertise, leadership, and communication skills to empower aspiring data scientists to excel in the field.
Section ID:
Class type:
This course is entirely web-based and to be completed asynchronously between the published course start and end dates. Synchronous attendance is NOT required.
You will have access to your online course on the published start date OR 1 business day after your enrollment is confirmed if you enroll on or after the published start date.
Textbooks:
All course materials are included unless otherwise stated.
Policies:
- No refunds after: 1/19/2026
Schedule:
Instructor:
Genaro Pimienta

After obtaining a PhD degree in structural biology (EMBL-Heidelberg, Germany), Genaro pursued post-doctoral training in proteomics and transcriptomics (Hopkins University and Yale University). Heralding the value of computational biology, Dr. Pimienta has, for the past 20 years, explored various ways of combining proteomics, transcriptomics and genomics data, to answer biomedical questions.
In his spare time, Genaro finds piece spending time with his family, gardening, or reading and writing fiction. A most recent joy of his, is to (pen in hand) spot the predictions of a world welded by artificial intelligence technologies, made by science fiction writers of last century, like Isaac Asimov, Phillip K. Dick, and William Gibson.
Section ID:
Class type:
This course is entirely web-based and to be completed asynchronously between the published course start and end dates. Synchronous attendance is NOT required.
You will have access to your online course on the published start date OR 1 business day after your enrollment is confirmed if you enroll on or after the published start date.
Textbooks:
All course materials are included unless otherwise stated.
Policies:
- No refunds after: 3/30/2026
Schedule:
Instructor: Tristan Carland
Tristan Carland is a manager of business systems in the genomics group at Q Squared Solutions of IQVIA, a leading global clinical trial laboratory services organization. His duties include the management of several business analysts and software product owners, as well as contributing to the design and implementation of process and software systems supporting the genomics group, ranging from bioinformatic solutions for new genomic assays to compliance related quality management. Before this he was a senior bioinformaticist with the same group managing individual tools and the scientific research and development of new genomic assays.
Previously, Carland was a software engineer with Illumina, the leading producer of DNA/RNA sequencing instruments, a researcher with The Scripps Research Institute and the J. Craig Venter Institute, and a graduate researcher at the Scripps Institution of Oceanography at UC San Diego. He is also a trained educator in inquiry based learning methods, invited lecturer for graduate and undergraduate bioinformatics courses, and previously awarded teaching assistant by student poll.
Carland completed his Ph.D. and received a master's in marine biology from UC San Diego after double majoring in marine biology and computer science at the University of North Carolina at Wilmington.