feat: add score document support in csv #696

bwanglzu · 2023-03-19T12:56:00Z

This PR allows user create a CSV file contains three columns, col1 and col2 are content, and col3 indicates the similarity between col1 and col2. Besides, I refactored the build_finetuning_dataset function.

This PR references an open issue
I have added a line about this change to CHANGELOG

LMMilliken

LGTM, I think the refactoring makes this much easier to read!
I think CSVHandler is a bit vague though, maybe CSVParser or CSVReader?

CHANGELOG.md

finetuner/data.py

guenthermi

Added some comments, if we really introduce the CSVContext class, we also need to update the documentation.

finetuner/data.py

guenthermi · 2023-03-20T13:09:17Z

Added some comments, if we really introduce the CSVContext class, we also need to update the documentation.

Ok probably, there is nothing in the documentation about building dataset from CSV, because we only apply it automatically if one provides a file path in the csv.

bwanglzu · 2023-03-20T14:24:36Z

documentation will be added in a seperate PR: #688 (comment)

CHANGELOG.md

Co-authored-by: George Mastrapas <[email protected]>

guenthermi

LGTM

guenthermi · 2023-03-21T08:04:58Z

finetuner/client/client.py

@@ -70,7 +70,7 @@ def list_experiments(self, page: int = 1, size: int = 50) -> Dict[str, Any]:
 ..note:: The maximum number for `size` per page is 100.
 """
 params = {'page': page, 'size': size}
- url = self._construct_url(self._base_url, API_VERSION, EXPERIMENTS)
+ url = self._construct_url(self._base_url, API_VERSION, EXPERIMENTS) + '/'


Can this be don in the construct_url function?

later we will investigate why this is happening

feat: add score document support in csv

e127ba2

github-actions bot added size/s area/core labels Mar 19, 2023

refactor: add csv handlers

05c5053

github-actions bot added size/m and removed size/s labels Mar 19, 2023

bwanglzu added 2 commits March 19, 2023 16:43

refactor: remove build finetuning dataset

6477723

test: add unit tests

358e78a

github-actions bot added the area/testing This issue/PR affects testing label Mar 19, 2023

bwanglzu added 3 commits March 19, 2023 18:17

test: fix csv reader add stringio as hints

1101463

feat: rename modality variable to col1 col2

5d6636c

test: add unit test

1595093

bwanglzu self-assigned this Mar 20, 2023

bwanglzu added 4 commits March 20, 2023 10:25

feat: use a trial input size for mlp

01762f9

feat: use a trial input size for mlp

6787dc0

refactor: fix task when model is mlp

c13d25d

chore: add changelog

5be341f

bwanglzu marked this pull request as ready for review March 20, 2023 10:16

bwanglzu requested review from LMMilliken, gmastrapas and guenthermi March 20, 2023 12:24

LMMilliken approved these changes Mar 20, 2023

View reviewed changes

gmastrapas reviewed Mar 20, 2023

View reviewed changes

guenthermi suggested changes Mar 20, 2023

View reviewed changes

finetuner/data.py Outdated Show resolved Hide resolved

finetuner/data.py Outdated Show resolved Hide resolved

finetuner/data.py Show resolved Hide resolved

bwanglzu added 3 commits March 20, 2023 14:09

feat: improve variable names and docstring

b2beaa7

feat: rename handler to parser

8e176ec

feat: add docstring to csv context

8045671

test: debug experiment endpoint

2b3d330

github-actions bot added size/l area/client and removed size/m labels Mar 20, 2023

test: debug experiment endpoint

c2a110f

gmastrapas approved these changes Mar 20, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

chore: update changelog

b40611e

Co-authored-by: George Mastrapas <[email protected]>

guenthermi approved these changes Mar 21, 2023

View reviewed changes

bwanglzu merged commit a24d95e into main Mar 21, 2023

bwanglzu deleted the feat-score-csv branch March 21, 2023 08:15

bwanglzu mentioned this pull request Mar 29, 2023

chore: release note 0.7.4 #702

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add score document support in csv #696

feat: add score document support in csv #696

bwanglzu commented Mar 19, 2023 •

edited

Loading

LMMilliken left a comment

guenthermi left a comment

guenthermi commented Mar 20, 2023

bwanglzu commented Mar 20, 2023

guenthermi left a comment

guenthermi Mar 21, 2023

bwanglzu Mar 21, 2023

feat: add score document support in csv #696

feat: add score document support in csv #696

Conversation

bwanglzu commented Mar 19, 2023 • edited Loading

LMMilliken left a comment

Choose a reason for hiding this comment

guenthermi left a comment

Choose a reason for hiding this comment

guenthermi commented Mar 20, 2023

bwanglzu commented Mar 20, 2023

guenthermi left a comment

Choose a reason for hiding this comment

guenthermi Mar 21, 2023

Choose a reason for hiding this comment

bwanglzu Mar 21, 2023

Choose a reason for hiding this comment

bwanglzu commented Mar 19, 2023 •

edited

Loading