-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MariaDB to PostgreSQL data migration #60
Comments
Some small questions @amyfromandi There seem to be more rows in
|
Also, one other comment — the v2 API here is meant to stay stable as v2, and not evolve to v3. So ensuring parity of responses from MariaDB to PostgreSQL is critical. |
We have to look into the following tables to understand how they were migrated into PostgreSQL
The following tables are not important
|
|
The following tables are not found in MariaDB but are important to preserve in PostgreSQL. We should copy their contents out of the
The tables should be "mirrored" into the
|
Here are the latest variances after MariaDB was restored into the dev environment.
|
This issue covers the current progress with the data migration initiative and outlines detailed next steps. Our goal is to eventually migrate all of the data in MariaDB into a PostgreSQL production database, ensuring the entire Macrostrat infrastructure operates off a single server.
Current milestones
We are currently in the process of updating a v2 API clone (possibly to become v3) by redirecting all endpoints from the MariaDB database to the PostgreSQL macrostrat_two clone. This step is to ensure that we can have a fully functioning API that works smoothly with PostgreSQL data only. We are almost complete with this step, and only have 7 more endpoints to redirect.
Next steps
After all endpoints are repointed, we need to do some in-depth testing to ensure that this API functions identically to production. As of now, we can currently say that the endpoints execute and successfully run when repointed to the PostgreSQL macrostrat_two clone. However, we need to ensure that data is being returned in the same format as production.
Code updates
Migration status reports
MariaDB to
macrostrat_two
(standalone PostgreSQL database)Below are some stats that show our progress with cloning the MariaDB data into the PostgreSQL macrostrat_two database. These results (from
compare_data_counts()
) show the table, row count, and column count comparisons between any two databases. Stats are executed from utils.py in MariaDB Migration to PostgreSQLmacrostrat_two
to existing Macrostrat database,macrostrat
schema in PostgreSQLWe conducted a data comparison between the cloned PostgreSQL macrostrat_two database and the existing PostgreSQL macrostrat database in the Macrostrat development environment. The results indicate discrepancies in tables, row counts, and column counts.
Next Steps
We'll need to decide how to resolve these discrepancies. Potential solutions could be merging the macrostrat_two data into the macrostrat database, or vice versa. We could also maintain the data in separate databases as is. Let me know your thoughts on how to proceed.
The text was updated successfully, but these errors were encountered: