DFSG NEW Queue

DFSG, Licensing & New Packages Team

simdutf 8.2.0-1

Package Information

DescriptionFast Unicode validation and transcoding - development files

Most modern software relies on the Unicode standard. In memory, Unicode strings are represented using either UTF-8 or UTF-16. The UTF-8 format is the de facto standard on the web (JSON, HTML, etc.) and it has been adopted as the default in many popular programming languages (Go, Zig, Rust, Swift, etc.). The UTF-16 format is standard in Java, C# and in many Windows technologies.

Not all sequences of bytes are valid Unicode strings. It is unsafe to use Unicode strings in UTF-8 and UTF-16LE without first validating them. Furthermore, we often need to convert strings from one encoding to another, by a process called transcoding. For security purposes, such transcoding should be validating: it should refuse to transcode incorrect strings.

This library provide fast Unicode functions such as

* ASCII, UTF-8, UTF-16LE/BE and UTF-32 validation, with and without error identification, * Latin1 to UTF-8 transcoding, * Latin1 to UTF-16LE/BE transcoding * Latin1 to UTF-32 transcoding * UTF-8 to Latin1 transcoding, with or without validation, with and without error identification, * UTF-8 to UTF-16LE/BE transcoding, with or without validation, with and without error identification, * UTF-8 to UTF-32 transcoding, with or without validation, with and without error identification, * UTF-16LE/BE to Latin1 transcoding, with or without validation, with and without error identification, * UTF-16LE/BE to UTF-8 transcoding, with or without validation, with and without error identification, * UTF-32 to Latin1 transcoding, with or without validation, with and without error identification, * UTF-32 to UTF-8 transcoding, with or without validation, with and without error identification, * UTF-32 to UTF-16LE/BE transcoding, with or without validation, with and without error identification, * UTF-16LE/BE to UTF-32 transcoding, with or without validation, with and without error identification, * From an UTF-8 string, compute the size of the Latin1 equivalent string, * From an UTF-8 string, compute the size of the UTF-16 equivalent string, * From an UTF-8 string, compute the size of the UTF-32 equivalent string (equivalent to UTF-8 character counting), * From an UTF-16LE/BE string, compute the size of the Latin1 equivalent string, * From an UTF-16LE/BE string, compute the size of the UTF-8 equivalent string, * From an UTF-32 string, compute the size of the UTF-8 or UTF-16LE equivalent string, * From an UTF-16LE/BE string, compute the size of the UTF-32 equivalent string (equivalent to UTF-16 character counting), * UTF-8 and UTF-16LE/BE character counting, * UTF-16 endianness change (UTF16-LE/BE to UTF-16-BE/LE), * WHATWG forgiving-base64 (with or without URL encoding) to binary, * Binary to base64 (with or without URL encoding).

The functions are accelerated using SIMD instructions (e.g., ARM NEON, SSE, AVX, AVX-512, RISC-V Vector Extension, LoongSon, POWER, etc.). When your strings contain hundreds of characters, we can often transcode them at speeds exceeding a billion characters per second. You should expect high speeds not only with English strings (ASCII) but also Chinese, Japanese, Arabic, and so forth. We handle the full character range (including, for example, emojis).

The library compiles down to a small library of a few hundred kilobytes. Our functions are exception-free and non allocating. We have extensive tests and extensive benchmarks.

This package ships the development files.

MaintainerMo Zhou <lumin@debian.org>
Changed ByJeremy Bícha <jbicha@ubuntu.com>
Sponsorjbicha@debian.org
Distributionunstable
Architectureany
VCSgit: https://salsa.debian.org/debian/simdutf.git (browse)
Popcon Installs5682
Binary NEWYes (binary-only upload)
Trackerhttps://tracker.debian.org/pkg/simdutf
Uploaded4 hours ago

New Package Report

.changes
DateWed, 01 Apr 2026 10:16:37 -0400
Sourcesimdutf
Version8.2.0-1
Changed-ByJeremy Bícha
Architecturesource amd64
Distributionunstable
Changelog
simdutf (8.2.0-1) unstable; urgency=medium
 .
   * New upstream release
   * Run wrap-and-sort -ast
   * Update library package name for soname bump
.dsc
Package-Listlibsimdutf-dev deb libdevel optional arch=any
libsimdutf-tools deb misc optional arch=any
libsimdutf33 deb libs optional arch=any
Sectionlibdevel
Priorityoptional
Componentmain
debian/copyright
Format: https://www.debian.org/doc/packaging-manuals/copyright-format/1.0/
Source: https://github.com/simdutf/simdutf
Upstream-Name: simdutf
Files-Excluded: benchmarks
Comment: The benchmarks are unused and are more complicated to accurately
 document in debian/copyright

Files: *
Copyright: The simdutf authors
License: Apache-2.0 or Expat

Files: cmake/CPM.cmake
Copyright: 2019-2023 Lars Melchior and contributors
License: Expat

Files: cmake/JoinPaths.cmake
Copyright: 2020 Jan Tojnar
License: Expat or CC0-1.0

Files: fuzz/helpers/nameof.hpp
Copyright: 2016-2024 Daniil Goncharov <neargye@gmail.com>
License: Expat

Files: include/simdutf/internal/isadetection.h
Copyright: 2016-     Facebook, Inc            (Adam Paszke)
           2014-     Facebook, Inc            (Soumith Chintala)
           2011-2014 Idiap Research Institute (Ronan Collobert)
           2012-2014 Deepmind Technologies    (Koray Kavukcuoglu)
           2011-2012 NEC Laboratories America (Koray Kavukcuoglu)
           2011-2013 NYU                      (Clement Farabet)
           2006-2010 NEC Laboratories America (Ronan Collobert, Leon Bottou,
                     Iain Melvin, Jason Weston)
           2006      Idiap Research Institute (Samy Bengio)
           2001-2004 Idiap Research Institute (Ronan Collobert, Samy Bengio,
                     Johnny Mariethoz)
License: BSD-3-clause
Comment: Highly modified version of
 https://github.com/endorno/pytorch/blob/master/torch/lib/TH/generic/simd/simd.h

License: Apache-2.0
 Licensed under the Apache License, Version 2.0 (the "License");
 you may not use this file except in compliance with the License.
 You may obtain a copy of the License at
 .
 https://www.apache.org/licenses/LICENSE-2.0
 .
 Unless required by applicable law or agreed to in writing, software
 distributed under the License is distributed on an "AS IS" BASIS,
 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 See the License for the specific language governing permissions and
 limitations under the License.
 .
 On Debian systems, the complete text of the Apache version 2.0 license
 can be found in "/usr/share/common-licenses/Apache-2.0".

License: BSD-3-clause
 Redistribution and use in source and binary forms, with or without
 modification, are permitted provided that the following conditions are met:
 .
 1. Redistributions of source code must retain the above copyright notice,
 this list of conditions and the following disclaimer.
 .
 2. Redistributions in binary form must reproduce the above copyright notice,
 this list of conditions and the following disclaimer in the documentation
 and/or other materials provided with the distribution.
 .
 3. Neither the names of Facebook, Deepmind Technologies, NYU, NEC Laboratories
 America and IDIAP Research Institute nor the names of its contributors may be
 used to endorse or promote products derived from this software without
 specific prior written permission.
 .
 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
 AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
 IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
 ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE
 LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
 CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF
 SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
 INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
 CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
 ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
 POSSIBILITY OF SUCH DAMAGE.

License: CC0-1.0
 To the extent possible under law, the author(s) have dedicated all
 copyright and related and neighboring rights to this software to the
 public domain worldwide. This software is distributed without any
 warranty.
 .
 You should have received a copy of the CC0 Public Domain Dedication
 along with this software. If not, see
 <https://creativecommons.org/publicdomain/zero/1.0/>.
 .
 On Debian systems, the complete text of the CC0 Public Domain
 Dedication can be found in "/usr/share/common-licenses/CC0-1.0".

License: Expat
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
 in the Software without restriction, including without limitation the rights
 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 copies of the Software, and to permit persons to whom the Software is
 furnished to do so, subject to the following conditions:
 .
 The above copyright notice and this permission notice shall be included in
 all copies or substantial portions of the Software.
 .
 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
 THE SOFTWARE.

Other Reviews of this Package

VersionHashAllocatedCompletedReviewerStatusDetails
8.0.0-1 38ee85f9… 2026-02-07 03:55 2026-02-07 04:07 awm accepted VIEW

Back to Dashboard | View all reviews for this package