Make Reward Model Preprocessing Modifications #3431

asdataminer · 2023-06-05T19:18:47Z

Code Pull Requests

Please provide the following:

a clear explanation of what your code does
if applicable, a reference to an issue
a reproducible test for your PR (code, config and data sample)

Documentation Pull Requests

Note that the documentation HTML files are in docs/ while the Markdown sources are in mkdocs/docs.

If you are proposing a modification to the documentation you should change only the Markdown files.

api.md is automatically generated from the docstrings in the code, so if you want to change something in that file, first modify ludwig/api.py docstring, then run mkdocs/code_docs_autogen.py, which will create mkdocs/docs/api.md .

github-actions · 2023-06-05T20:55:15Z

Unit Test Results

      6 files ±      0       6 suites ±0 42m 54s ⏱️ - 36m 44s
2 780 tests +2 747 2 733 ✔️ +2 704   9 💤 +  5   38 ❌ +  38
8 346 runs +8 247 8 199 ✔️ +8 112 33 💤 +21 114 ❌ +114

For more details on these failures, see this check.

Results for commit 440cbec. ± Comparison against base commit 9112470.

This pull request removes 33 and adds 2780 tests. Note that renamed tests count towards both.

tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-experiment-1919-0]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-experiment-1919-1]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-experiment-31-0]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-experiment-31-1]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-train-1919-0]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-train-1919-1]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-train-31-0]
tests.integration_tests.test_cli ‑ test_reproducible_cli_runs[horovod-train-31-1]
tests.integration_tests.test_cli ‑ test_train_cli_horovod
tests.integration_tests.test_experiment ‑ test_experiment_model_resume_distributed[horovod]
…

tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_image_augmentation[augmentation_pipeline_ops0]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_image_augmentation[augmentation_pipeline_ops1]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_image_augmentation[augmentation_pipeline_ops2]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_invalid_augmentation_parameters[None]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_invalid_augmentation_parameters[augmentation_pipeline_ops1]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_invalid_augmentation_parameters[augmentation_pipeline_ops2]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_invalid_augmentation_parameters[augmentation_pipeline_ops4]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_invalid_augmentation_parameters[random_horizontal_flip]
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_load_model_with_augmentation_pipeline
tests.ludwig.augmentation.test_augmentation_pipeline ‑ test_local_model_training_with_augmentation_pipeline[preprocessing0-encoder0-False]
…

This pull request removes 4 skipped tests and adds 9 skipped tests. Note that renamed tests count towards both.

tests.integration_tests.test_horovod ‑ test_horovod_gpu_memory_limit
tests.regression_tests.benchmark.test_model_performance ‑ test_performance[ames_housing.ecd.yaml]
tests.regression_tests.benchmark.test_model_performance ‑ test_performance[mercedes_benz_greener.ecd.yaml]
tests.regression_tests.benchmark.test_model_performance ‑ test_performance[sarcos.ecd.yaml]

tests.ludwig.automl.test_base_config
tests.ludwig.automl.test_utils
tests.ludwig.backend.test_ray
tests.ludwig.benchmarking.test_profiler
tests.ludwig.data.test_ray_data
tests.ludwig.models.test_training_determinism ‑ test_training_determinism_ray_backend
tests.ludwig.utils.test_fs_utils ‑ test_get_fs_and_path_invalid_windows
tests.ludwig.utils.test_hyperopt_ray_utils ‑ test_grid_strategy[test_1]
tests.ludwig.utils.test_hyperopt_ray_utils ‑ test_grid_strategy[test_2]

♻️ This comment has been updated with latest results.

tgaddair · 2023-06-05T23:33:55Z

ludwig/data/preprocessing.py

@@ -1205,6 +1205,50 @@ def build_dataset(
            logger.debug(f"sample {sample_ratio} of data")
            dataset_df = dataset_df.sample(frac=sample_ratio, random_state=random_seed)

+        # If training a reward model, perform grouping and joining on dataset


Nice! This looks like the right set of transformations, but I think we'll likely want to do this at the end of preprocessing, using the processed text input feature, rather than here at the beginning. Specifically, I would consider doing this here:

https://github.com/ludwig-ai/ludwig/blob/master/ludwig/data/preprocessing.py#L1364

I would also suggest adding a test in test_preprocessing.py to verify it works end-to-end on a synthetic dataset.

Sounds good! I have made the edits and refactored the code a bit to be simpler.

asdataminer added 2 commits June 5, 2023 12:07

Make preprocessing modifications V1

2bc1d01

Add dataset validation

d6cc331

asdataminer force-pushed the rlhf_reward branch from ba290a6 to d6cc331 Compare June 5, 2023 19:31

Small edit

ea07940

tgaddair reviewed Jun 5, 2023

View reviewed changes

asdataminer added 2 commits June 7, 2023 15:44

Add tests

e993d7b

Small edits

98db29b

asdataminer force-pushed the rlhf_reward branch from e4b4b0f to 98db29b Compare June 8, 2023 00:02

asdataminer added 3 commits June 7, 2023 17:02

Another small edit

518feca

Modify processing strategy

8ac5101

Small edit

b61a174

asdataminer force-pushed the rlhf_reward branch from e4baa9e to b61a174 Compare June 12, 2023 03:07

asdataminer requested a review from geoffreyangus June 12, 2023 03:08

asdataminer added 2 commits June 11, 2023 20:28

Another small edit

e78152d

Small edit

440cbec

mhabedank closed this Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Reward Model Preprocessing Modifications #3431

Make Reward Model Preprocessing Modifications #3431

asdataminer commented Jun 5, 2023

github-actions bot commented Jun 5, 2023 •

edited

Loading

tgaddair Jun 5, 2023

asdataminer Jun 12, 2023

Make Reward Model Preprocessing Modifications #3431

Make Reward Model Preprocessing Modifications #3431

Conversation

asdataminer commented Jun 5, 2023

Code Pull Requests

Documentation Pull Requests

github-actions bot commented Jun 5, 2023 • edited Loading

Unit Test Results

tgaddair Jun 5, 2023

Choose a reason for hiding this comment

asdataminer Jun 12, 2023

Choose a reason for hiding this comment

github-actions bot commented Jun 5, 2023 •

edited

Loading