Commit Graph

289 Commits

Author SHA1 Message Date
Anthony MOI
a9f4c5950a Python - Test Decoders 2020-04-01 17:25:55 -04:00
Anthony MOI
0de9885da8 Python - Test PreTokenizers 2020-04-01 17:25:55 -04:00
Anthony MOI
d6692d4072 Python - Test Normalizers 2020-04-01 17:25:55 -04:00
Anthony MOI
3264ffe235 Python - Improve tests on Tokenizer 2020-04-01 17:25:55 -04:00
Anthony MOI
5ebe687753 Python - Add first implementations tests 2020-04-01 17:25:55 -04:00
Anthony MOI
023566fbbb Python - Add some tests utils 2020-04-01 17:25:55 -04:00
Anthony MOI
477037fd6b Python - Improve AddedToken repr 2020-04-01 17:25:55 -04:00
Anthony MOI
b055b77b54 Python - Add first tests: Tokenizer 2020-04-01 17:25:55 -04:00
Anthony MOI
f15c088cf3 Python - Hotfix typing import 2020-04-01 11:35:07 -04:00
Anthony MOI
2a84ef12cf Python - Add missing get_vocab from BaseTokenizer 2020-04-01 11:32:54 -04:00
Anthony MOI
93a83127ae Bump version for Python release 2020-03-31 14:25:47 -04:00
Morgan Funtowicz
afe9cfe96e Strip should inherits from Normalizer on Python binding.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
2020-03-31 20:20:09 +02:00
Anthony MOI
279db8537e Update CHANGELOGs 2020-03-27 18:46:20 -04:00
Anthony MOI
5038a7f74e Update CHANGELOGs 2020-03-27 17:49:02 -04:00
Anthony MOI
a2a6d80017 Python - expost get_vocab on Tokenizer 2020-03-27 11:53:18 -04:00
Anthony MOI
e8aec7a624 Bump version for Python release 2020-03-27 09:17:35 -04:00
Morgan Funtowicz
bc46064a61 Added missing imports for AddedToken 2020-03-27 12:32:23 +01:00
Anthony MOI
b132be34af Bump version for Python release 2020-03-26 17:26:14 -04:00
Anthony MOI
6c232886b8 Improve & update CHANGELOGs 2020-03-26 17:08:42 -04:00
Anthony MOI
4341c79d85 Python - last fixes on Encoding bindings/typings 2020-03-26 15:42:45 -04:00
Anthony MOI
14e3ab3787 Python - fix style 2020-03-26 15:42:45 -04:00
Morgan Funtowicz
39958a2f0f TokenizedSequence / TokenizedSequenceWithOffsets needs to be declared in .py files not only .pyi 2020-03-26 15:42:45 -04:00
Morgan Funtowicz
68405a6fae Forward type_id in encode_tokenized/encode_tokenized_batch python binding. 2020-03-26 15:42:45 -04:00
Anthony MOI
9bd9e0b3c1 Expose post_process on the Tokenizer 2020-03-26 15:42:45 -04:00
Anthony MOI
9ce895550b Add some new merging capability on Encoding 2020-03-26 15:42:44 -04:00
Anthony MOI
eec74ca3e6 Python - Add Model.encode_batch and improve typings 2020-03-26 15:42:44 -04:00
Anthony MOI
1150751ab6 Python - Update mappings API 2020-03-26 15:42:44 -04:00
Anthony MOI
a397a1da63 Python - Expose encode method on Model 2020-03-26 15:42:44 -04:00
Anthony MOI
8de6ef5a37 Python - Bind new Encoding's mappings 2020-03-26 15:42:44 -04:00
Anthony MOI
e8925a33da Python - remove add_special_tokens from BertWordPieceTokenizer init 2020-03-26 14:19:37 -04:00
Anthony MOI
f8d54edcdd Python - Fix cases where str expected instead of AddedToken 2020-03-25 19:22:53 -04:00
Anthony MOI
c65d53892d Python - Add bindings for new AddedToken options 2020-03-24 20:58:45 -04:00
Anthony MOI
d953d58cee Rust - Fix offsets when there are added tokens 2020-03-19 12:53:03 -04:00
Anthony MOI
d53de0e2da Python - Expose normalize on BaseTokenizer 2020-03-18 16:44:31 -04:00
Anthony MOI
ae0d330907 Update CHANGELOGs 2020-03-18 16:42:27 -04:00
Anthony MOI
60a4fb35f4 Python - Update bindings 2020-03-16 10:36:42 -04:00
Morgan Funtowicz
505bfbba82 Fix invalid error messages. 2020-03-12 15:38:29 +01:00
Morgan Funtowicz
5ed1f26c71 Throw a more meaningful error when provided python input is None. 2020-03-12 10:59:05 +01:00
Anthony MOI
257360acec Python - encode & encode batch with add_special_tokens 2020-03-10 16:21:10 -04:00
Anthony MOI
a9be177185 Update CHANGELOGs 2020-03-10 13:12:34 -04:00
Anthony MOI
28f022058c Keep default values as true 2020-03-10 12:58:53 -04:00
Anthony MOI
45f3eaaf72 Update bindings and typings 2020-03-10 12:28:24 -04:00
Anthony MOI
efbbfea558 Update ByteLevel PostProcessor 2020-03-10 12:05:04 -04:00
Anthony MOI
7e9003ccb7 Python - Update bindings 2020-03-09 18:37:03 -04:00
Anthony MOI
86d2e90ad2 Update CHANGELOGs 2020-03-06 17:44:44 -05:00
Anthony MOI
d778ed5e0a Python - Update README and implementation 2020-03-06 17:44:44 -05:00
Anthony MOI
52180a9179 Python - Add ByteLevel PostProcessor 2020-03-06 17:44:44 -05:00
Anthony MOI
b60eef5245 Python - Make style 2020-03-06 17:44:44 -05:00
Anthony MOI
d8e7a830b2 Update CHANGELOGs 2020-03-06 17:44:34 -05:00
Anthony MOI
b2e5f54b6f Python - Fix ByteLevelBPETokenizer implementation 2020-03-06 17:44:03 -05:00