-
Notifications
You must be signed in to change notification settings - Fork 3.4k
add streaming beam search for cache aware models to NeMo inference #15768
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
lilithgrigoryan
wants to merge
33
commits into
main
Choose a base branch
from
lgrigoryan/streaming-beam-search-niva-cache-aware
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
3a6f29e
merge
lilithgrigoryan 03937b0
add n-chunk reseting working
lilithgrigoryan e889eea
saving config
lilithgrigoryan e51ee3c
add n-chunk reseting working
lilithgrigoryan 3765acc
add eou resetting
lilithgrigoryan 0e11a4f
clean up debug prints
lilithgrigoryan 913000a
Merge branch 'main' of https://github.com/NVIDIA/NeMo into lgrigoryan…
lilithgrigoryan 6dc1423
typecast fix
lilithgrigoryan 0a69dee
clean up
lilithgrigoryan e051b12
isort and black + clean up
lilithgrigoryan 4dce1e6
clean up
lilithgrigoryan 893f656
clean up
lilithgrigoryan 664a246
isort and black
lilithgrigoryan b342a16
add per-stream biasing
lilithgrigoryan b9a31a4
clean up
lilithgrigoryan 0ed3d92
clean up
lilithgrigoryan 56816df
refactor, separate state
lilithgrigoryan f6da7a5
isort and black
lilithgrigoryan 22f5f7d
clean up
lilithgrigoryan a299ee4
restore docstring
lilithgrigoryan 957084f
move malsd stream step to model wrapper
lilithgrigoryan 49eb9fe
clean up
lilithgrigoryan 7be3088
refactor per-stream biasing, add utils
lilithgrigoryan 629de90
add malsd-only warning
lilithgrigoryan c59bc00
isort and black
lilithgrigoryan 36e1702
Merge branch 'main' of https://github.com/NVIDIA/NeMo into lgrigoryan…
lilithgrigoryan 3656938
restore releasing biaing models
lilithgrigoryan 7840a22
minor clean up
lilithgrigoryan fadef4e
clean up
lilithgrigoryan b2b7116
isort and black
lilithgrigoryan 205e85c
clean up
lilithgrigoryan 211632e
minor changes
lilithgrigoryan 58c7df9
Merge branch 'main' of https://github.com/NVIDIA/NeMo into lgrigoryan…
lilithgrigoryan File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.