Data layer rework by trias702 · Pull Request #72 · NVIDIA/NeMo-Aligner

trias702 · 2024-01-05T21:40:30Z

What does this PR do ?

Rewrites the builder and dataset classes/functions to remove a lot of the superfluous holdover elements. Also unifies the RM and DPO jsonl formats, and switches the RM model to use GBS batching similar to DPO.

Changelog

Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation? Make sure to also update the NeMo Framework User Guide which contains the tutorials

Checklist when contributing a new algorithm

Does the trainer resume and restore model state all states?
Does the trainer support all parallelism techniques(PP, TP, DP)?
Does the trainer support max_steps=-1 and validation?
Does the trainer only call APIs defined in alignable_interface.py?
Does the trainer have proper logging?

Additional Information

Related to # (issue)

Signed-off-by: Daniel Egert <degert@nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: Daniel Egert <degert@nvidia.com>

…ligner into degert/data-rework

trias702 added 2 commits December 12, 2023 18:27

Initial commit for data rework

c5d4b33

Signed-off-by: Daniel Egert <degert@nvidia.com>

Merged in main branch

1877a8f

Signed-off-by: Daniel Egert <degert@nvidia.com>

trias702 requested review from gshennvm and odelalleau January 5, 2024 21:40

pre-commit-ci Bot and others added 3 commits January 5, 2024 21:40

[pre-commit.ci] auto fixes from pre-commit.com hooks

f254a75

for more information, see https://pre-commit.ci

Merged in main branch

56be156

Signed-off-by: Daniel Egert <degert@nvidia.com>

Merge branch 'degert/data-rework' of https://github.com/NVIDIA/NeMo-A…

dcdfe79

…ligner into degert/data-rework

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data layer rework#72

Data layer rework#72
trias702 wants to merge 5 commits intomainfrom
degert/data-rework

trias702 commented Jan 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

trias702 commented Jan 5, 2024

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Checklist when contributing a new algorithm

Additional Information

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant