Anthony MOI
1c4593cad4
Python - Remove warning on unused Token
2019-12-13 15:28:48 -05:00
Anthony MOI
e93cc62a71
Python - Handle kwargs for bert modules
2019-12-13 15:28:29 -05:00
Anthony MOI
3355be89cd
Python - Update examples and improve errors
2019-12-13 14:37:29 -05:00
Anthony MOI
7cf4b3a6cd
Python - Rewrite PyDecoder and PyPreTokenizer
2019-12-13 12:20:25 -05:00
Anthony MOI
2a0ad97809
Python - Update API to allow failure
2019-12-13 12:20:05 -05:00
Anthony MOI
1c7be358b7
Python - Better error conversions
2019-12-13 12:14:27 -05:00
Anthony MOI
7cb2fe2ea0
Bump version
2019-12-10 18:01:07 -05:00
Anthony MOI
b4b31d73cd
Expose vocabulary size
2019-12-10 16:20:31 -05:00
Anthony MOI
6c294c60b0
Python - Add Encoding repr + improve example
2019-12-10 15:18:07 -05:00
Anthony MOI
99773d9ce4
Python - Add encoding getters
2019-12-10 15:17:41 -05:00
Anthony MOI
8cedc5f1f6
Update Python bindings for Encoding
2019-12-10 12:38:36 -05:00
Anthony MOI
018f57f054
Python - Update example
2019-12-09 12:51:05 -05:00
Anthony MOI
849272d44f
Python - add missing modules exports
2019-12-09 12:50:53 -05:00
Anthony MOI
3979096c52
Python - add BasicPreTokenizer
2019-12-09 12:50:09 -05:00
Anthony MOI
d60d24a378
Python - Add WordPiece model
2019-12-09 12:49:44 -05:00
Anthony MOI
5eba30835d
Python - Add WordPiece decoder
2019-12-09 12:49:17 -05:00
Anthony MOI
c46ec97855
Update README
2019-12-03 17:26:20 -05:00
Anthony MOI
75232c0f06
Fix setup.py
2019-12-03 16:20:20 -05:00
Anthony MOI
499f5507df
Bump versions for 0.0.3 release
2019-12-03 16:11:45 -05:00
Anthony MOI
ec2ed483a3
Improve python readme with training example
2019-12-03 16:11:03 -05:00
Anthony MOI
eaafb22511
Add bindings for Trainer in Python
2019-12-03 15:54:15 -05:00
Anthony MOI
5011523e99
Update python readme
2019-12-03 10:26:19 -05:00
MOI Anthony
5f31ac3f75
Python release CI ( #2 )
2019-12-02 19:04:25 -05:00
Anthony MOI
5db08ac15d
Update wheel building
2019-11-29 22:36:17 -05:00
Anthony MOI
27ac65c466
Remove onig dependency
2019-11-29 21:35:16 -05:00
Anthony MOI
428890d6e0
Basic python setuptools
2019-11-27 12:21:37 -05:00
Anthony MOI
e49abab747
Python - Add Decoder/PreTokenizer standalone capabilities
2019-11-26 17:52:19 -05:00
Anthony MOI
d565bbf309
Container - Add ability to execute
2019-11-26 17:51:26 -05:00
Anthony MOI
f4369b312d
Python - Add ability to create custom Decoder
2019-11-25 19:14:07 -05:00
Anthony MOI
512e85dfda
Update python README
2019-11-24 00:55:13 -05:00
Anthony MOI
6437c40235
Python - PoC Custom PreTokenizer
2019-11-24 00:52:13 -05:00
Anthony MOI
b081e6ca04
Python - Also expose default classes
2019-11-24 00:35:05 -05:00
Anthony MOI
bd1aa80d8a
Python - Custom PreTokenizer backbone
2019-11-23 23:59:33 -05:00
Anthony MOI
891fc12de2
Python - Update example with new format
2019-11-22 21:09:17 -05:00
Anthony MOI
8fbe3c2662
Python - Add decoders
2019-11-22 21:08:57 -05:00
Anthony MOI
e44f52024c
Python - Set a PreTokenizer in a model
2019-11-22 21:01:52 -05:00
Anthony MOI
9b71c8f8de
Python - BPE construction
2019-11-22 20:57:54 -05:00
Anthony MOI
f6a9b57b5b
Python - Add pre_tokenizers module
2019-11-22 20:56:50 -05:00
Anthony MOI
39a6d04c53
Improve Python bindings
...
This is an attempt at actually exposing the same structure that we use in the Rust lib. This will allow Python to instantiate Model/PreTokenizer/... with their own arguments, combining everything without relying on parsed kwargs.
2019-11-22 17:57:36 -05:00
Anthony MOI
663644e041
Fix ByteLevel Decoder
...
The join was done after replacing bytes and building subwords, which was preventing bytes across these subwords to be merged correctly. We need to join first.
2019-11-21 16:50:25 -05:00
Anthony MOI
070fd08583
Update python example
2019-11-21 11:57:57 -05:00
Anthony MOI
c28a83cdc4
Update python bindings
2019-11-21 11:55:07 -05:00
Anthony MOI
3ec26b332c
Add Tokenizer token_to_id/id_to_token
2019-11-20 17:28:28 -05:00
Anthony MOI
8b3d7d1aa0
Add vocab/merge arguments to example.py
2019-11-20 16:47:02 -05:00
Anthony MOI
98323d1f21
Update readme and fix example
2019-11-19 19:38:57 -05:00
Anthony MOI
351d526e1e
Basic python bindings
2019-11-19 19:31:37 -05:00
Anthony MOI
05cbb32eca
Python readme
2019-11-01 19:42:36 -04:00
Anthony MOI
fd7ec39367
Update python bindings
2019-11-01 18:56:55 -04:00
Anthony MOI
8448d50e6f
Quick improvement over python bindings
2019-11-01 16:08:10 -04:00
Anthony MOI
5d37cfde7f
Python bindings backbone
2019-11-01 15:02:19 -04:00