Data Science and Computational Approaches
Find information about Computational Approaches in Biology and Data Science employing R, Python, Matlab, including computational approaches for the following:
to implement big data analysis in biological sciences (computational biology),
to monitor students’ learning and performance in the classroom,
to implement machine learning methods.
Example of Proteomic Analysis of Ovarian Cancer: This example proteomic data analysis, performed in the Matlab computer language, analyzed SELDI-TOF mass spectrometry from blood samples of two groups: ovarian cancer vs. control group. In this example, Matlab scripts present the pre-process of the data, generate plots of the groups' spectrograms, and rank features using a two-way t-statistic. The data analyzed in this example is from the FDA-NCI Clinical Proteomics Program.
Differential in Gene Expression between Fetal and Adult Brains: In this genomic data science project, the goal was to evaluate differences in gene expression between fetal and adult brains. Transcriptome sequencing data (known as RNA-seq) for this genomic data science project was sequenced on an Illumina platform from human post-mortem brains. The raw sequencing data and meta-data related to each brain sample were retrieved from a public database. The data was aligned, quality control was performed on the alignments, and gene count levels of expression were quantified. With the aid of exploratory and fitting statistical models, patterns of gene expression were identified between fetus and adult brains. This genomic data science project provided insights into genes differentially expressed between fetus and adult brains that help understand human development and represents an example of reproducible research in genomic sciences.
Applying Data Science Approaches in Biological Sciences Classrooms: Evidence-based decisions in the classroom are essential for us as professors to implement adjustments in students' learning, including in biological sciences courses. In this project, I'll share data science approaches, projects, and peer-review publications that could benefit in implementing pedagogical interventions in your classrooms.
Recently-Published Reports
Correct/Incorrect Ratio in Staining Techniques in Microbiology Lab
Data Analysis in a General Education Biology Course after the First Summative Assessment
Text Mining R Code to Determine Most Frequent Words among Students’ Hypotheses
An R function to generate proportions and logistic regression for of correct answers
Shinny app to predict users' next word: This project describes the pitch for a shiny app elaborated to predict next-words followings user's words input.
Other biological and non-biological data sciences