OS-Climate - Establish minimum versions of tools / packages for Dev Cluster #234

MichaelTiemannOSC · 2022-11-16T18:44:41Z

xref #98

High-level question: if we use conda as our base installation system, users can install shell-level packages such as ghostscript without needing to beg for special install help. With pip/pipenv, we are entirely limited to Python. What, really, is the best choice here?

The text was updated successfully, but these errors were encountered:

HeatherAck · 2022-11-28T18:35:49Z

remove highlander. Categorize and group, note installed version. Create a standard config for notebooks (default for users)

Architecture diagram of interdependencies/layers: call out dependencies and inheritance, @redmikhail to share starting point from Humair
-[ ] Phase 1: open metadata, dbt
Python-related
Individual packages not under ODH: e.g. fybrik
ODH-related: (go back to default operator config - @redmikhail to check with @HumairAK to see what problems will occur if we leverage default)
- Need to determine validation methodolgy, who will perform UAT (e.g. Trino connectors)
- Test cases for each upgrade (e.g. smoke test demo with expected results) (Heather to create new issues for each workstream)

HeatherAck · 2022-12-05T18:13:45Z

main reason for separation - capacity concerns, limited functionality - how ODH treats and updates subcomponents needed updating. Need to understand where ODH going - treat trino, superset separately as this is also being done within the ODH community.

@redmikhail to consolidate approvers - create separate team to perform PR merge; regain control

HeatherAck · 2022-12-05T18:26:08Z

Add LF team members to maintainer status: Apps, Support (https://github.com/operate-first/support) plus others (@redmikhail to add others and provide PR): 6-Dec
Consider a separate branch of Operate First for LF OS-Climate? To be discussed in upcoming meeting with Marcel and team.

HeatherAck · 2022-12-19T18:53:02Z

@redmikhail to update list this week

Meeting held with Marcel. Operate First shrinking - consider use of managed service from Redhat / AWS (Open Shift specifically - SREs); stable cluster is a better candidate for that but not all svcs are covered under managed svc - e.g. GPU usage split.

In January meeting,

Need to meet and discuss pros/cons on moving to ODH as primary implementation (so can get the latest Jupyter release) with overlays for specific packages (e.g. Open Metadata) versus Operate First (where certain releases lag behind latest ones).
Need plan for Open Shift operational support.
Need to understand any custom configurations required by OS-C
Need to understand definition of stable cluster (what is support model for applications as well as cluster itself)

Complexity of dealing with a platform (OS-Climate) on top of a platform (ODH) on top of a platform (Operate First)

How do users create their own Data Commons? What is goal overall - keeping data private as well as accessing data via data exchange (see also Data Commons implementation requirements within own local environment #245)

@HeatherAck to schedule meeting week of 9-Jan to align on pros/cons and discuss path forward.

MichaelTiemannOSC · 2022-12-19T19:41:25Z

As an open source software project, OS-Climate provides the raw materials for users to contribute to and/or fork project elements as they see fit. If users have their own ideas about what it means to run the Data Commons within their own local environment, it should be those users doing the legwork of what that actually means, and committing the resources necessary to push patches they want to see into the upstream source code (which OS-Climate should review and potentially accept). But I don't think the OS-Climate project should try too hard to imagine and prototype those use cases itself. Rather it should help guide users to do that work for themselves.

HeatherAck · 2023-01-09T18:39:40Z

Need to determine pace and sizing / priorities for each element on 10-Jan, consistent developer process/implementation

MichaelTiemannOSC · 2023-01-10T18:11:54Z

I've been updating version numbers, but calling to attention that Open Metadata release 0.13.1.3 Jan 9th that provides important fixes vs. 0.13.1.

HeatherAck · 2023-01-30T18:56:10Z

@redmikhail @ryanaslett @MightyNerdEric @erikerlandson to focus on upgrading system level software (see ODH sub packages above: e.g. Trino, Jupyter, Python)

HeatherAck · 2023-01-30T19:08:39Z

@caldeirav - do we need to use Elyra pipeline features?
@erikerlandson / @redmikhail - to define standard developer notebook SW configs needed - libraries - Heather to schedule mtg with @erikerlandson - Guillome Moutier may be able to provide support

MichaelTiemannOSC · 2023-01-31T11:00:06Z

openmetadata team has been busily walking their versions forward. 0.13.2.1 just released (see https://github.com/open-metadata/OpenMetadata/releases for info about 0.13.2).

HeatherAck · 2023-02-06T18:58:02Z

@ryanaslett to start investigating latest ODH version compared to installed. Prep work - manifest storage, look at operate first manifest. (week of 6-Feb)

HeatherAck · 2023-02-13T18:44:25Z

@ryanaslett - trying to define what ODH contributions is OS-C going to make, but still want to move forward with core component upgrade. No easy way to upgrade. Need to install from scratch and migrate functionality (e.g., notebooks, etc.) and access/authentication. May require diff packages - such as superset. Start with CL1, eliminate old ODH and install new one. Get feedback on new version / verify functioning as expected.

Open question: @caldeirav will ODH be a stable version that we will use long term? please confirm that Red Hat team will contribute to ODH going forward.

HeatherAck · 2023-02-13T18:57:31Z

note: fork it to OS-Climate (not operate first). keep as separate repo. See if ODH supports SQL Alchemy 2.0

HeatherAck · 2023-02-27T19:01:38Z

@ryanaslett reviewed superset, no dependencies - only api's; will review the ODH components, figure out core offering as part of ODH. Recommended next steps: (1) Install new core ODH on CL1 with tier 0 components (MUST HAVE). (2) Bring over jupyter hub images (verify that authentication works), then (3) bring over other components after review (Trino, Open MetaData)

MichaelTiemannOSC added the cluster/osc-cl2 label Nov 16, 2022

MichaelTiemannOSC added this to Data Commons Platform Nov 16, 2022

HeatherAck mentioned this issue Nov 17, 2022

OS-Climate - Establish the base software versions and tools for Stable instance cluster #98

Closed

HeatherAck assigned eb-oss Nov 28, 2022

HeatherAck assigned redmikhail and HeatherAck Nov 28, 2022

HeatherAck moved this to In Progress in Data Commons Platform Dec 19, 2022

This was referenced Jan 22, 2023

Template for Data Pipelines #253

Closed

Onboard ESSD dataset using Open Metadata #183

Open

HeatherAck changed the title ~~OS-Climate Dev instance cluster~~ OS-Climate - Establish minimum versions of tools / packages for Dev Cluster Jan 30, 2023

HeatherAck assigned ryanaslett Feb 7, 2023

redmikhail mentioned this issue Mar 10, 2023

Install latest version of ODH operator without Trino #252

Open

eb-oss mentioned this issue Jul 26, 2023

Upgrade Trino on CL1 and CL2 to latest version #323

Closed

HeatherAck mentioned this issue Aug 14, 2023

Boto3 needs Python >= 3.8 from december on os-climate/corporate_data_extraction#20

Closed

HeatherAck removed this from Data Commons Platform Sep 18, 2023

HeatherAck added this to Data Commons - Q4 2023 Sep 18, 2023

HeatherAck moved this to In Progress in Data Commons - Q4 2023 Sep 18, 2023

eharrison24 moved this from In Progress to Backlog in Data Commons - Q4 2023 Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OS-Climate - Establish minimum versions of tools / packages for Dev Cluster #234

OS-Climate - Establish minimum versions of tools / packages for Dev Cluster #234

MichaelTiemannOSC commented Nov 16, 2022 •

edited

Loading

HeatherAck commented Nov 28, 2022 •

edited

Loading

HeatherAck commented Dec 5, 2022

HeatherAck commented Dec 5, 2022

HeatherAck commented Dec 19, 2022 •

edited

Loading

MichaelTiemannOSC commented Dec 19, 2022

HeatherAck commented Jan 9, 2023

MichaelTiemannOSC commented Jan 10, 2023

HeatherAck commented Jan 30, 2023

HeatherAck commented Jan 30, 2023

MichaelTiemannOSC commented Jan 31, 2023

HeatherAck commented Feb 6, 2023

HeatherAck commented Feb 13, 2023

HeatherAck commented Feb 13, 2023

HeatherAck commented Feb 27, 2023

OS-Climate - Establish minimum versions of tools / packages for Dev Cluster #234

OS-Climate - Establish minimum versions of tools / packages for Dev Cluster #234

Comments

MichaelTiemannOSC commented Nov 16, 2022 • edited Loading

HeatherAck commented Nov 28, 2022 • edited Loading

HeatherAck commented Dec 5, 2022

HeatherAck commented Dec 5, 2022

HeatherAck commented Dec 19, 2022 • edited Loading

MichaelTiemannOSC commented Dec 19, 2022

HeatherAck commented Jan 9, 2023

MichaelTiemannOSC commented Jan 10, 2023

HeatherAck commented Jan 30, 2023

HeatherAck commented Jan 30, 2023

MichaelTiemannOSC commented Jan 31, 2023

HeatherAck commented Feb 6, 2023

HeatherAck commented Feb 13, 2023

HeatherAck commented Feb 13, 2023

HeatherAck commented Feb 27, 2023

MichaelTiemannOSC commented Nov 16, 2022 •

edited

Loading

HeatherAck commented Nov 28, 2022 •

edited

Loading

HeatherAck commented Dec 19, 2022 •

edited

Loading