unicodestrip

package
v0.0.0-...-f288654 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Nov 10, 2024 License: MIT Imports: 5 Imported by: 0

Documentation

Overview

Package unicodestrip provides a Bleve keyword filter which decomposes unicode strings.

Index

Constants

View Source
const Name = "strip_unicode"

Variables

This section is empty.

Functions

func Constructor

func Constructor(config map[string]interface{}, cache *registry.Cache) (analysis.TokenFilter, error)

Constructor provides a constructor for Bleve.

Types

type StripUnicodeFilter

type StripUnicodeFilter struct {
}

StripUnicodeFilter is a Bleve keyword filter which decomposes unicode strings into their normalized form and strips away non-spacing marks. Effectively, this strips away diacritic marks so that searches may be done without entering them, e.g. "Télépopmusik" is indexed as "Telepopmusik".

func New

func New() (*StripUnicodeFilter, error)

New returns a new instance of StripUnicodeFilter.

func (*StripUnicodeFilter) Filter

Filter removes non-spacing marks from text in a token stream.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL