Skip to content

Suggesting lookup list entries #255

@jhoetter

Description

@jhoetter

Is your feature request related to a problem? Please describe.
I want to quickly extend my lookup lists with further values, and want to find further values of records that I didn't even label yet.

Describe the solution you'd like
With a token-based embedding, we should be able to compute n-grams (see below for more context) and compute similarity search based on the entries we already have. That way, we could find synonyms etc. from the corpus we have at hand, which could be super helpful.

Again, this could be something that is actively requested by pressing a button in the lookup list, which then goes on and does the similarity search and creates suggestions.

Describe alternatives you've considered
-

Additional context
Google search for n-grams

An n-gram is a sequence. n-gram. of n words: a 2-gram (which we'll call bigram) is a two-word sequence of words. like “please turn”, “turn your”, or ”your homework”, and a 3-gram (a trigram) is a three-word sequence of words like “please turn your”, or “turn your homework”.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions