Welcome to My GitHub Portfolio.
I am a scientific professional with a PhD in Cancer Research, 2 years of postdoctoral research specialising in CRISPR gene editing, and 3 years of oncology drug discovery experience at a world-leading CRO. As I transition from the wet lab to the field of data science, this portfolio represents my journey and development across various domains, including bioinformatics, computational biology, and machine learning.
With a focus on real-world applications within the life sciences, particularly in drug discovery, my work here showcases projects that blend scientific insight with data-driven techniques. Each project reflects my commitment to leveraging computational methods to address key biological questions and drive advancements in therapeutics.
- R Programming: Bioconductor, Tidyverse
- Python Programming: Numpy, Pandas, Matplotlib, Seaborn
- Machine Learning: Regression and classification (scikit-learn); deep learning and neural networks (TensorFlow, Keras)
- Version Control: Git
- Data Science Environments: Jupyter, Anaconda, Command Line
- ELN/Compound Management: Dotmatics, Vortex, ChemInventory
- Cloud Platforms: Benchling, Egnyte
- SQL: Relational database management and processing
- Computational Biology: Comprehensive analysis of RNASeq data using Bioconductor packages such as DESeq2 and clusterProfiler.
- Machine Learning: Development of predictive models for drug discovery using various publicly available datasets and extracting information on bioactivity of drug-like compounds, gene expression data, mutations, etc.
- Data Batch Processing and Automation: Building automated methods for batch processing multiple datasets using Python and Command Line.
π€ Please follow my GitHub profile for learning, teaching, knowledge sharing, and collaborating! I look forward to connecting with fellow enthusiasts and professionals in the field!
βοΈ Email [email protected] π LinkedIn | GitHub
Thank you for visiting, and I hope you find these projects insightful.