DFSG NEW Queue

DFSG, Licensing & New Packages Team

Review: tokenizers 0.20.3+dfsg-1

Review Information

Packagetokenizers — 0.20.3+dfsg-1
Reviewermechtilde
Allocated5 days ago
Started5 days ago
Statusaccepted
Completed5 days ago

Final Comment

New Package Report

Notes

5 days ago ● public

Licenserecon

Command: lrc
Exit code: 3

de: Versions: licenserecon '11.0'  licensecheck '3.3.9-1'

Quellbaum analysieren  ....
Lesen d/copyright  ....
Wird ausgeführt licensecheck ....

d/copyright      | licensecheck

Apache-2.0       | Expat             bindings/node/LICENSE
Apache-2.0       | Expat             bindings/node/npm/android-arm64/package.json
Apache-2.0       | Expat             bindings/node/npm/android-arm-eabi/package.json
Apache-2.0       | Expat             bindings/node/npm/darwin-arm64/package.json
Apache-2.0       | Expat             bindings/node/npm/darwin-x64/package.json
Apache-2.0       | Expat             bindings/node/npm/freebsd-x64/package.json
Apache-2.0       | Expat             bindings/node/npm/linux-arm64-gnu/package.json
Apache-2.0       | Expat             bindings/node/npm/linux-arm64-musl/package.json
Apache-2.0       | Expat             bindings/node/npm/linux-arm-gnueabihf/package.json
Apache-2.0       | Expat             bindings/node/npm/linux-x64-gnu/package.json
Apache-2.0       | Expat             bindings/node/npm/linux-x64-musl/package.json
Apache-2.0       | Expat             bindings/node/npm/win32-arm64-msvc/package.json
Apache-2.0       | Expat             bindings/node/npm/win32-ia32-msvc/package.json
Apache-2.0       | Expat             bindings/node/npm/win32-x64-msvc/package.json
Apache-2.0       | Expat and/or ISC  bindings/node/.yarn/releases/yarn-3.5.1.cjs
Expat or Apache-2.0| Apache-2.0        tokenizers/examples/unstable_wasm/www/LICENSE-APACHE
Expat or Apache-2.0| Expat             tokenizers/examples/unstable_wasm/www/LICENSE-MIT
Expat or Apache-2.0| Expat             tokenizers/examples/unstable_wasm/www/package.json
Expat or Apache-2.0| Expat             tokenizers/examples/unstable_wasm/www/package-lock.json

5 days ago ● public

licensecheck

Command: licensecheck -c '.*' -r --deb-machine -l 0 .
Exit code: 0

Format: https://www.debian.org/doc/packaging-manuals/copyright-format/1.0/
Upstream-Name: FIXME
Upstream-Contact: FIXME
Source: FIXME
Disclaimer: Autogenerated by licensecheck

Files: ./.github/conda/bld.bat
 ./.github/conda/build.sh
 ./.github/stale.yml
 ./.github/workflows/build_documentation.yml
 ./.github/workflows/build_pr_documentation.yml
 ./.github/workflows/delete_doc_comment.yml
 ./.github/workflows/delete_doc_comment_trigger.yml
 ./.github/workflows/docs-check.yml
 ./.github/workflows/node-release.yml
 ./.github/workflows/node.yml
 ./.github/workflows/python-release-conda.yml
 ./.github/workflows/python-release.yml
 ./.github/workflows/python.yml
 ./.github/workflows/rust-release.yml
 ./.github/workflows/rust.yml
 ./.github/workflows/stale.yml
 ./.github/workflows/trufflehog.yml
 ./.github/workflows/upload_pr_documentation.yml
 ./README.md
 ./RELEASE.md
 ./bindings/node/.cargo/config.toml
 ./bindings/node/.editorconfig
 ./bindings/node/.eslintrc.yml
 ./bindings/node/.gitattributes
 ./bindings/node/.prettierignore
 ./bindings/node/.taplo.toml
 ./bindings/node/.yarnrc.yml
 ./bindings/node/Cargo.toml
 ./bindings/node/Makefile
 ./bindings/node/README.md
 ./bindings/node/build.rs
 ./bindings/node/examples/documentation/pipeline.test.ts
 ./bindings/node/examples/documentation/quicktour.test.ts
 ./bindings/node/index.d.ts
 ./bindings/node/index.js
 ./bindings/node/jest.config.js
 ./bindings/node/lib/bindings/__mocks__/vocab.json
 ./bindings/node/lib/bindings/__mocks__/vocab.txt
 ./bindings/node/lib/bindings/decoders.test.ts
 ./bindings/node/lib/bindings/encoding.test.ts
 ./bindings/node/lib/bindings/models.test.ts
 ./bindings/node/lib/bindings/normalizers.test.ts
 ./bindings/node/lib/bindings/post-processors.test.ts
 ./bindings/node/lib/bindings/pre-tokenizers.test.ts
 ./bindings/node/lib/bindings/tokenizer.test.ts
 ./bindings/node/lib/bindings/utils.test.ts
 ./bindings/node/npm/android-arm-eabi/README.md
 ./bindings/node/npm/android-arm64/README.md
 ./bindings/node/npm/darwin-arm64/README.md
 ./bindings/node/npm/darwin-x64/README.md
 ./bindings/node/npm/freebsd-x64/README.md
 ./bindings/node/npm/linux-arm-gnueabihf/README.md
 ./bindings/node/npm/linux-arm64-gnu/README.md
 ./bindings/node/npm/linux-arm64-musl/README.md
 ./bindings/node/npm/linux-x64-gnu/README.md
 ./bindings/node/npm/linux-x64-musl/README.md
 ./bindings/node/npm/win32-arm64-msvc/README.md
 ./bindings/node/npm/win32-ia32-msvc/README.md
 ./bindings/node/npm/win32-x64-msvc/README.md
 ./bindings/node/rustfmt.toml
 ./bindings/node/src/arc_rwlock_serde.rs
 ./bindings/node/src/decoders.rs
 ./bindings/node/src/encoding.rs
 ./bindings/node/src/lib.rs
 ./bindings/node/src/models.rs
 ./bindings/node/src/normalizers.rs
 ./bindings/node/src/pre_tokenizers.rs
 ./bindings/node/src/processors.rs
 ./bindings/node/src/tasks/mod.rs
 ./bindings/node/src/tasks/models.rs
 ./bindings/node/src/tasks/tokenizer.rs
 ./bindings/node/src/tokenizer.rs
 ./bindings/node/src/trainers.rs
 ./bindings/node/src/utils.rs
 ./bindings/node/tsconfig.json
 ./bindings/node/types.ts
 ./bindings/node/yarn.lock
 ./bindings/python/.cargo/config.toml
 ./bindings/python/CHANGELOG.md
 ./bindings/python/Cargo.lock
 ./bindings/python/Cargo.toml
 ./bindings/python/MANIFEST.in
 ./bindings/python/Makefile
 ./bindings/python/README.md
 ./bindings/python/benches/test_tiktoken.py
 ./bindings/python/conftest.py
 ./bindings/python/examples/custom_components.py
 ./bindings/python/examples/example.py
 ./bindings/python/examples/train_bert_wordpiece.py
 ./bindings/python/examples/train_bytelevel_bpe.py
 ./bindings/python/examples/train_with_datasets.py
 ./bindings/python/examples/using_the_visualizer.ipynb
 ./bindings/python/py_src/tokenizers/__init__.py
 ./bindings/python/py_src/tokenizers/__init__.pyi
 ./bindings/python/py_src/tokenizers/decoders/__init__.py
 ./bindings/python/py_src/tokenizers/decoders/__init__.pyi
 ./bindings/python/py_src/tokenizers/implementations/__init__.py
 ./bindings/python/py_src/tokenizers/implementations/base_tokenizer.py
 ./bindings/python/py_src/tokenizers/implementations/bert_wordpiece.py
 ./bindings/python/py_src/tokenizers/implementations/byte_level_bpe.py
 ./bindings/python/py_src/tokenizers/implementations/char_level_bpe.py
 ./bindings/python/py_src/tokenizers/implementations/sentencepiece_bpe.py
 ./bindings/python/py_src/tokenizers/implementations/sentencepiece_unigram.py
 ./bindings/python/py_src/tokenizers/models/__init__.py
 ./bindings/python/py_src/tokenizers/models/__init__.pyi
 ./bindings/python/py_src/tokenizers/normalizers/__init__.py
 ./bindings/python/py_src/tokenizers/normalizers/__init__.pyi
 ./bindings/python/py_src/tokenizers/pre_tokenizers/__init__.py
 ./bindings/python/py_src/tokenizers/pre_tokenizers/__init__.pyi
 ./bindings/python/py_src/tokenizers/processors/__init__.py
 ./bindings/python/py_src/tokenizers/processors/__init__.pyi
 ./bindings/python/py_src/tokenizers/tools/__init__.py
 ./bindings/python/py_src/tokenizers/tools/visualizer-styles.css
 ./bindings/python/py_src/tokenizers/tools/visualizer.py
 ./bindings/python/py_src/tokenizers/trainers/__init__.py
 ./bindings/python/py_src/tokenizers/trainers/__init__.pyi
 ./bindings/python/rust-toolchain
 ./bindings/python/scripts/convert.py
 ./bindings/python/scripts/sentencepiece_extractor.py
 ./bindings/python/scripts/spm_parity_check.py
 ./bindings/python/setup.cfg
 ./bindings/python/src/decoders.rs
 ./bindings/python/src/encoding.rs
 ./bindings/python/src/error.rs
 ./bindings/python/src/lib.rs
 ./bindings/python/src/models.rs
 ./bindings/python/src/normalizers.rs
 ./bindings/python/src/pre_tokenizers.rs
 ./bindings/python/src/processors.rs
 ./bindings/python/src/token.rs
 ./bindings/python/src/tokenizer.rs
 ./bindings/python/src/trainers.rs
 ./bindings/python/src/utils/iterators.rs
 ./bindings/python/src/utils/mod.rs
 ./bindings/python/src/utils/normalization.rs
 ./bindings/python/src/utils/pretokenization.rs
 ./bindings/python/src/utils/regex.rs
 ./bindings/python/src/utils/serde_pyo3.rs
 ./bindings/python/stub.py
 ./bindings/python/test.txt
 ./bindings/python/tests/bindings/test_decoders.py
 ./bindings/python/tests/bindings/test_encoding.py
 ./bindings/python/tests/bindings/test_models.py
 ./bindings/python/tests/bindings/test_normalizers.py
 ./bindings/python/tests/bindings/test_pre_tokenizers.py
 ./bindings/python/tests/bindings/test_processors.py
 ./bindings/python/tests/bindings/test_tokenizer.py
 ./bindings/python/tests/bindings/test_trainers.py
 ./bindings/python/tests/documentation/test_pipeline.py
 ./bindings/python/tests/documentation/test_quicktour.py
 ./bindings/python/tests/documentation/test_tutorial_train_from_iterators.py
 ./bindings/python/tests/implementations/test_base_tokenizer.py
 ./bindings/python/tests/implementations/test_bert_wordpiece.py
 ./bindings/python/tests/implementations/test_byte_level_bpe.py
 ./bindings/python/tests/implementations/test_char_bpe.py
 ./bindings/python/tests/implementations/test_sentencepiece.py
 ./bindings/python/tests/test_serialization.py
 ./bindings/python/tests/utils.py
 ./debian/cargo-config.toml
 ./debian/changelog
 ./debian/control
 ./debian/patches/downgrade_onig.patch
 ./debian/patches/series
 ./debian/patches/update_itertools_under_tokenizer_dir.patch
 ./debian/patches/upgrade_itertools.patch
 ./debian/patches/upgrade_ndarray_version.patch
 ./debian/patches/upgrade_rayon_cond.patch
 ./debian/patches/upgrade_spm_precompiled.patch
 ./debian/patches/upgrade_thiserror.patch
 ./debian/rules
 ./debian/salsa-ci.yml
 ./debian/source/format
 ./debian/tests/control
 ./debian/watch
 ./tokenizers/CHANGELOG.md
 ./tokenizers/Cargo.toml
 ./tokenizers/Makefile
 ./tokenizers/README.md
 ./tokenizers/README.tpl
 ./tokenizers/benches/bert_benchmark.rs
 ./tokenizers/benches/bpe_benchmark.rs
 ./tokenizers/benches/common/mod.rs
 ./tokenizers/benches/layout_benchmark.rs
 ./tokenizers/benches/llama3.rs
 ./tokenizers/benches/unigram_benchmark.rs
 ./tokenizers/examples/encode_batch.rs
 ./tokenizers/examples/serialization.rs
 ./tokenizers/examples/unstable_wasm/Cargo.toml
 ./tokenizers/examples/unstable_wasm/README.md
 ./tokenizers/examples/unstable_wasm/src/lib.rs
 ./tokenizers/examples/unstable_wasm/src/utils.rs
 ./tokenizers/examples/unstable_wasm/tests/web.rs
 ./tokenizers/examples/unstable_wasm/www/.bin/create-wasm-app.js
 ./tokenizers/examples/unstable_wasm/www/.travis.yml
 ./tokenizers/examples/unstable_wasm/www/README.md
 ./tokenizers/examples/unstable_wasm/www/bootstrap.js
 ./tokenizers/examples/unstable_wasm/www/index.html
 ./tokenizers/examples/unstable_wasm/www/index.js
 ./tokenizers/examples/unstable_wasm/www/webpack.config.js
 ./tokenizers/rust-toolchain
 ./tokenizers/src/decoders/bpe.rs
 ./tokenizers/src/decoders/byte_fallback.rs
 ./tokenizers/src/decoders/ctc.rs
 ./tokenizers/src/decoders/fuse.rs
 ./tokenizers/src/decoders/mod.rs
 ./tokenizers/src/decoders/sequence.rs
 ./tokenizers/src/decoders/strip.rs
 ./tokenizers/src/decoders/wordpiece.rs
 ./tokenizers/src/lib.rs
 ./tokenizers/src/models/bpe/mod.rs
 ./tokenizers/src/models/bpe/model.rs
 ./tokenizers/src/models/bpe/serialization.rs
 ./tokenizers/src/models/bpe/trainer.rs
 ./tokenizers/src/models/bpe/word.rs
 ./tokenizers/src/models/mod.rs
 ./tokenizers/src/models/unigram/lattice.rs
 ./tokenizers/src/models/unigram/mod.rs
 ./tokenizers/src/models/unigram/model.rs
 ./tokenizers/src/models/unigram/serialization.rs
 ./tokenizers/src/models/unigram/trainer.rs
 ./tokenizers/src/models/unigram/trie.rs
 ./tokenizers/src/models/wordlevel/mod.rs
 ./tokenizers/src/models/wordlevel/serialization.rs
 ./tokenizers/src/models/wordlevel/trainer.rs
 ./tokenizers/src/models/wordpiece/mod.rs
 ./tokenizers/src/models/wordpiece/serialization.rs
 ./tokenizers/src/models/wordpiece/trainer.rs
 ./tokenizers/src/normalizers/bert.rs
 ./tokenizers/src/normalizers/byte_level.rs
 ./tokenizers/src/normalizers/mod.rs
 ./tokenizers/src/normalizers/precompiled.rs
 ./tokenizers/src/normalizers/prepend.rs
 ./tokenizers/src/normalizers/replace.rs
 ./tokenizers/src/normalizers/strip.rs
 ./tokenizers/src/normalizers/unicode.rs
 ./tokenizers/src/normalizers/utils.rs
 ./tokenizers/src/pre_tokenizers/bert.rs
 ./tokenizers/src/pre_tokenizers/byte_level.rs
 ./tokenizers/src/pre_tokenizers/delimiter.rs
 ./tokenizers/src/pre_tokenizers/digits.rs
 ./tokenizers/src/pre_tokenizers/metaspace.rs
 ./tokenizers/src/pre_tokenizers/mod.rs
 ./tokenizers/src/pre_tokenizers/punctuation.rs
 ./tokenizers/src/pre_tokenizers/sequence.rs
 ./tokenizers/src/pre_tokenizers/split.rs
 ./tokenizers/src/pre_tokenizers/unicode_scripts/mod.rs
 ./tokenizers/src/pre_tokenizers/unicode_scripts/pre_tokenizer.rs
 ./tokenizers/src/pre_tokenizers/unicode_scripts/scripts.rs
 ./tokenizers/src/pre_tokenizers/whitespace.rs
 ./tokenizers/src/processors/bert.rs
 ./tokenizers/src/processors/mod.rs
 ./tokenizers/src/processors/roberta.rs
 ./tokenizers/src/processors/sequence.rs
 ./tokenizers/src/processors/template.rs
 ./tokenizers/src/tokenizer/added_vocabulary.rs
 ./tokenizers/src/tokenizer/encoding.rs
 ./tokenizers/src/tokenizer/mod.rs
 ./tokenizers/src/tokenizer/normalizer.rs
 ./tokenizers/src/tokenizer/pattern.rs
 ./tokenizers/src/tokenizer/pre_tokenizer.rs
 ./tokenizers/src/tokenizer/serialization.rs
 ./tokenizers/src/utils/cache.rs
 ./tokenizers/src/utils/fancy.rs
 ./tokenizers/src/utils/from_pretrained.rs
 ./tokenizers/src/utils/iter.rs
 ./tokenizers/src/utils/mod.rs
 ./tokenizers/src/utils/onig.rs
 ./tokenizers/src/utils/padding.rs
 ./tokenizers/src/utils/parallelism.rs
 ./tokenizers/src/utils/progress.rs
 ./tokenizers/src/utils/truncation.rs
 ./tokenizers/tests/added_tokens.rs
 ./tokenizers/tests/common/mod.rs
 ./tokenizers/tests/documentation.rs
 ./tokenizers/tests/from_pretrained.rs
 ./tokenizers/tests/offsets.rs
 ./tokenizers/tests/serialization.rs
 ./tokenizers/tests/training.rs
 ./tokenizers/tests/unigram.rs
Copyright: NONE
License: UNKNOWN
 FIXME

Files: ./bindings/node/npm/android-arm-eabi/package.json
 ./bindings/node/npm/android-arm64/package.json
 ./bindings/node/npm/darwin-arm64/package.json
 ./bindings/node/npm/darwin-x64/package.json
 ./bindings/node/npm/freebsd-x64/package.json
 ./bindings/node/npm/linux-arm-gnueabihf/package.json
 ./bindings/node/npm/linux-arm64-gnu/package.json
 ./bindings/node/npm/linux-arm64-musl/package.json
 ./bindings/node/npm/linux-x64-gnu/package.json
 ./bindings/node/npm/linux-x64-musl/package.json
 ./bindings/node/npm/win32-arm64-msvc/package.json
 ./bindings/node/npm/win32-ia32-msvc/package.json
 ./bindings/node/npm/win32-x64-msvc/package.json
 ./tokenizers/examples/unstable_wasm/www/LICENSE-MIT
 ./tokenizers/examples/unstable_wasm/www/package-lock.json
 ./tokenizers/examples/unstable_wasm/www/package.json
Copyright: NONE
License: Expat
 FIXME

Files: ./.github/conda/meta.yaml
 ./CITATION.cff
 ./LICENSE
 ./bindings/node/package.json
 ./tokenizers/examples/unstable_wasm/www/LICENSE-APACHE
Copyright: NONE
License: Apache-2.0
 FIXME

Files: ./bindings/python/pyproject.toml
Copyright: NONE
License: Apache
 FIXME

Files: ./debian/copyright
Copyright: 2019-2025, Anthony MOI <m.anthony.moi@gmail.com>
  2019-2025, The HuggingFace Team
  2025, Kohei Sendai <kouhei.sendai@gmail.com>
  Ashley Williams <ashley666ashley@gmail.com>
License: Apache-2.0 and/or Expat
 FIXME

Files: ./bindings/node/.yarn/releases/yarn-3.5.1.cjs
Copyright: 2014
  2014, Blake Embrey (hello@blakeembrey.com)
  2014-2016, Jon Schlinkert.
  2014-2017, Jon Schlinkert.
  2015
  2015, Rebecca Turner
  2015-2018, Jon Schlinkert.
  Joyent, Inc. and other Node contributors.
  Node.js contributors.
License: BSD-2-clause and/or Expat and/or ISC
 FIXME

Files: ./bindings/node/LICENSE
Copyright: 2020, N-API for Rust
License: Expat
 FIXME

Back to Dashboard | View all reviews for this package