This project is ongoing and subject to continuous advancements and modifications.
This project focuses on estimating photometric redshifts, which are crucial for studying the large-scale structure of the universe and the distribution of galaxies. It employs Gaussian processes as a flexible non-parametric approach to effectively model uncertainties in photometric data. The project also integrates various data analysis techniques to enhance accuracy and performance, offering a comprehensive framework for photometric redshift estimation and other ML and AI methods, benchmarking between them to observe each method's performance in terms of accuracy and computational time.
The project aims to test existing methodologies, such as Gaussian processes, to calculate photometric redshifts and mass estimates on a dataset with known redshifts, like Stripe 82X, to validate and benchmark the approach. The results will then be reproduced using the older dataset to ensure consistency and accuracy, demonstrating alignment with published data. Once validated, the methodology will be adapted and applied to other wide X-ray fields with incomplete redshift data, such as XMM XXL, while addressing challenges posed by inhomogeneous data coverage. The performance of the approach will be evaluated across datasets with varying depths and completeness to optimize its reliability for diverse datasets. Finally, the methodology will be scaled for fields with no redshifts, enabling broader application in X-ray AGN studies and mass estimation while leveraging advanced machine learning techniques.
- Gaussian Process Regression: Leverage Gaussian processes to estimate redshifts, allowing for a quantifiable measure of uncertainties.
- Data Handling and Preprocessing: Tools for cleaning and preparing synthetic datasets based on the Sloan Digital Sky Survey (SDSS).
- Advanced Data Analysis: Combines Gaussian processes with other statistical and machine learning techniques to enhance predictive power.
- Visualization Tools: Includes tools for visualizing redshift distributions, error margins, and overall model performance.
- Thorough Documentation: Detailed explanations and example notebooks for easy understanding and reproducibility.
- Python Version: Python 3.8+
- Key Packages:
- Core Scientific Libraries: NumPy, Pandas, SciPy
- Machine Learning and Statistical Modeling: Scikit-Learn, GPflow
- Visualization Tools: Matplotlib, Seaborn
- Astronomy-Specific Tools: Astroquery, Astropy
- Deep Learning (Optional): TensorFlow
- Utilities: tqdm, h5py
- Clone the Repository
git clone https://github.com/Adrita-Khan/AstroPhotoZ.git
- Install Dependencies
pip install -r requirements.txt
- Run Example Notebooks
To help you get started with the project, you can run the following example notebooks:
Photometric_Redshift_Dataset_Exploration.ipynb
: An exploratory analysis of the photometric redshift dataset to understand underlying patterns and features.Synthetic_Photometric_Redshift_Predictor.ipynb
: A step-by-step guide to predicting photometric redshifts using synthetic data.Sample_Galaxy_Redshift_Prediction_py.ipynb
: An example notebook for predicting galaxy redshifts using real data.
Follow the notebooks to apply Gaussian processes (notebooks and scripts will be updated and shared as the work progresses) and other data analysis techniques to photometric data. Hyperparameters and methods can be adjusted to suit specific research requirements.
Contributions are welcome! Please feel free to open issues, suggest improvements, or submit pull requests.
This project is licensed under the MIT License.
- Scikit-Learn Astronomy Regression Tutorial
- Photo-z Regression Demo - Mofokeng Chaka
- Multi-Wavelength Classification and Regression
- PhotoZ_SDSS by Tasos Theodoropoulos
- TITAN Project - PhotoZ SDSS ML
- Photometric Redshifts - Martian Side of the Moon
- AstroML - Forest Photometric Redshift Estimation
- Photometric Redshift Estimation - Amber
- Photometric Redshift Estimation by Qbeer
- MLZ: Machine Learning Redshifts
- SDSS4 DR16Q Tutorial by Qiaoya Wu
- SDSS DR8 Data Access
- SDSS DR9 Photo-z Algorithms
- SDSS DR17 Photo-z Algorithms
- sdss Python Package
- Astroquery SDSS Documentation
- Astroquery SDSS API
- SDSS DR14 SkyServer SQL Search
- SDSS DR18 SkyServer SQL Search
- SDSS DR17 Algorithms - Photo-z: A comprehensive guide to the photometric redshift algorithms used in SDSS DR17.
- The Dark Energy Survey Data Management System: Explore figures and insights from the Dark Energy Survey, focusing on data management and redshift estimation techniques.
- Understanding Redshift - Sky at Night Magazine: A detailed article explaining the concept of redshift, its significance in astronomy, and its applications in understanding the universe.
- Hubble Law Introduction: An introductory resource on Hubble's Law and its implications in cosmology.
For any inquiries or feedback, please contact:
Adrita Khan