Skip to content

Conversation

@hendrikvanantwerpen
Copy link
Contributor

@hendrikvanantwerpen hendrikvanantwerpen commented Oct 17, 2024

Eliminates look-ahead in the pre-tokenization regexes by supporting manually implemented trim functions in the pretokenizer.

The pretokenizer still accepts fancy regex, but since we don't use these features, it'll fall back to the regular regex crate.

Tasks

  • Run benchmark, update figures and text.

@hendrikvanantwerpen hendrikvanantwerpen self-assigned this Oct 17, 2024
@hendrikvanantwerpen
Copy link
Contributor Author

Closed in favor of #33.

@hendrikvanantwerpen hendrikvanantwerpen deleted the eliminate-look-ahead branch October 18, 2024 16:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants