Name	Name	Last commit message	Last commit date
parent directory ..
src	src
test	test
CHANGELOG.md	CHANGELOG.md
README.md	README.md
eslint.config.mjs	eslint.config.mjs
package.json	package.json
tsconfig.json	tsconfig.json
tsconfig.lib.json	tsconfig.lib.json
tsconfig.spec.json	tsconfig.spec.json
vite.config.ts	vite.config.ts

Name

Last commit message

Last commit date

@lde/text-normalization

Zero-dependency text folding for search index and query normalization.

fold() produces a diacritic- and case-insensitive form of a string, applied identically at index time and query time so that a search index never diverges from the queries run against it (divergence = silent search misses).

import { fold } from '@lde/text-normalization';

fold('Møhlmann'); // 'mohlmann'
fold('Coöperatieve'); // 'cooperatieve'
fold('Straße'); // 'strasse'

It combines Unicode NFKD decomposition + combining-mark stripping (which folds é, ö, å, ç, …) with an explicit transliteration map for letters that do not decompose under NFKD (ø, æ, œ, ß, ð, þ, ł, đ, …).

When it’s needed

A search engine on its default locale often folds case and diacritics for you – Typesense v30 (verified) even folds the non-decomposing ø/æ/ß – so on the default locale fold() is redundant for search. It becomes necessary when:

Sorting – engines sort strings by raw code-point order with no collation, so a fold()-ed companion field is the only way to sort case- and diacritic-insensitively.
Stemming – enabling a language’s stemmer requires a non-default locale, which switches the tokenizer (Typesense → ICU) to one that preserves diacritics; the default folding is lost, and fold() restores diacritic-insensitive matching.

fold() is idempotent (fold(fold(x)) === fold(x)). Punctuation and word boundaries are preserved; tokenization is left to the search engine.

Because folded values are stored in the search index, the same fold() must be used at index time and query time, and any change to it requires a full rebuild.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

@lde/text-normalization

When it’s needed

Uh oh!

FilesExpand file tree

text-normalization

Directory actions

More options

Directory actions

More options

Latest commit

History

text-normalization

Folders and files

parent directory

README.md

@lde/text-normalization

When it’s needed