Skip to content

A project for estimating photometric redshifts using Gaussian processes and more. This repository includes a pipeline for data preprocessing, model training, and evaluation, focusing on improving accuracy in redshift prediction to enhance large-scale cosmic structure studies.

License

Notifications You must be signed in to change notification settings

Adrita-Khan/AstroPhotoZ

Repository files navigation

Photometric Redshifts Estimation

This project is ongoing and subject to continuous advancements and modifications.

Python Version License

Project Overview

This project focuses on estimating photometric redshifts, which are crucial for studying the large-scale structure of the universe and the distribution of galaxies. It employs Gaussian processes as a flexible non-parametric approach to effectively model uncertainties in photometric data. The project also integrates various data analysis techniques to enhance accuracy and performance, offering a comprehensive framework for photometric redshift estimation and other ML and AI methods, benchmarking between them to observe each method's performance in terms of accuracy and computational time.

Galaxy Redshift

Aim

The project aims to test existing methodologies, such as Gaussian processes, to calculate photometric redshifts and mass estimates on a dataset with known redshifts, like Stripe 82X, to validate and benchmark the approach. The results will then be reproduced using the older dataset to ensure consistency and accuracy, demonstrating alignment with published data. Once validated, the methodology will be adapted and applied to other wide X-ray fields with incomplete redshift data, such as XMM XXL, while addressing challenges posed by inhomogeneous data coverage. The performance of the approach will be evaluated across datasets with varying depths and completeness to optimize its reliability for diverse datasets. Finally, the methodology will be scaled for fields with no redshifts, enabling broader application in X-ray AGN studies and mass estimation while leveraging advanced machine learning techniques.

Features

  • Gaussian Process Regression: Leverage Gaussian processes to estimate redshifts, allowing for a quantifiable measure of uncertainties.
  • Data Handling and Preprocessing: Tools for cleaning and preparing synthetic datasets based on the Sloan Digital Sky Survey (SDSS).
  • Advanced Data Analysis: Combines Gaussian processes with other statistical and machine learning techniques to enhance predictive power.
  • Visualization Tools: Includes tools for visualizing redshift distributions, error margins, and overall model performance.
  • Thorough Documentation: Detailed explanations and example notebooks for easy understanding and reproducibility.

Photometric Redshift vs True Redshift

Hubble Law Animation

Requirements

  • Python Version: Python 3.8+
  • Key Packages:
    • Core Scientific Libraries: NumPy, Pandas, SciPy
    • Machine Learning and Statistical Modeling: Scikit-Learn, GPflow
    • Visualization Tools: Matplotlib, Seaborn
    • Astronomy-Specific Tools: Astroquery, Astropy
    • Deep Learning (Optional): TensorFlow
    • Utilities: tqdm, h5py

Getting Started

  1. Clone the Repository
    git clone https://github.com/Adrita-Khan/AstroPhotoZ.git
  2. Install Dependencies
    pip install -r requirements.txt
  3. Run Example Notebooks

To help you get started with the project, you can run the following example notebooks:

  1. Photometric_Redshift_Dataset_Exploration.ipynb: An exploratory analysis of the photometric redshift dataset to understand underlying patterns and features.
  2. Synthetic_Photometric_Redshift_Predictor.ipynb: A step-by-step guide to predicting photometric redshifts using synthetic data.
  3. Sample_Galaxy_Redshift_Prediction_py.ipynb: An example notebook for predicting galaxy redshifts using real data.

Usage

Follow the notebooks to apply Gaussian processes (notebooks and scripts will be updated and shared as the work progresses) and other data analysis techniques to photometric data. Hyperparameters and methods can be adjusted to suit specific research requirements.

Contributing

Contributions are welcome! Please feel free to open issues, suggest improvements, or submit pull requests.

License

This project is licensed under the MIT License.

Resources

Tutorials and Demos

AstroML Resources

Deep Learning

SDSS Data and Tools

Additional Resources


Contact

For any inquiries or feedback, please contact:

Adrita Khan

📧 Email | 🔗 LinkedIn | 🐦 Twitter

About

A project for estimating photometric redshifts using Gaussian processes and more. This repository includes a pipeline for data preprocessing, model training, and evaluation, focusing on improving accuracy in redshift prediction to enhance large-scale cosmic structure studies.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published