Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add mapping between SPDX AI and Dataset Profiles and MLCommons Croissant #962

Open
bact opened this issue Jan 23, 2025 · 0 comments
Open
Labels
Profile:AI Artificial Intelligence Profile and related matters Profile:Dataset Dataset Profile and related matters
Milestone

Comments

@bact
Copy link
Collaborator

bact commented Jan 23, 2025

From 2025-01-22 AI meeting, we may like to have a mapping between SPDX Dataset fields and MLCommons Croissant for interop with major dataset platforms.

Croissant metadata format provides a vocabulary for dataset attributes, streamlining how data is loaded across ML frameworks such as PyTorch, TensorFlow or JAX. https://github.com/mlcommons/croissant

Croissant Format Specification 1.0
https://docs.mlcommons.org/croissant/docs/croissant-spec.html

@bact bact added Profile:AI Artificial Intelligence Profile and related matters Profile:Dataset Dataset Profile and related matters labels Jan 23, 2025
@bact bact added this to the 3.1 milestone Jan 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Profile:AI Artificial Intelligence Profile and related matters Profile:Dataset Dataset Profile and related matters
Projects
None yet
Development

No branches or pull requests

1 participant