Glossary of Search Terms

Glossary of search terms

January 2023

Absolute boosting

Ensuring that a specified document always appears at the same point in a results set, or always appears on the first page of results

Access control list (ACL)

Defines access permissions at a user or group level (often based on Active Directory) to specific repository, a set of documents, or a section of a document

Advanced search

The provision of a search user interface which prompts the user to enter additional terms to assist in retrieving results, often using Boolean operators.

Apache

The Apache Foundation provides support for a wide range of open source applications, including Lucene and Solr

Appliance

A search application pre-installed on a server ready for insertion into a standard server rack

Aggregated search

The presentation of related content items (often referred to as verticals) from a single index in a specific area of a page of search results

Artificial intelligence

A set of technologies that enable machines to sense, comprehend, act and learn in a manner that seeks to emulate a human response to a situation

Auto-categorization

An automated process for creating a classification system (or taxonomy) from a collection of nominally related documents

Auto-classification

An automated process for assigning metadata or index values to documents, usually in conjunction with an existing taxonomy

Average response time

An average of the time taken for the search engine to respond to a query, or the average end-to-end time of a query

BERT

Bidirectional Encoder Representations from Transformers (BERT) is a machine learning technique which enhances the performance of training based on natural language processing.

Best bets

Results that are selected to appear at the top of a list of results that provide a context for other documents generated and ranked by the search application

BM25 (Best Match 25)

A ranking algorithm developed in the 1990s of which there are now multiple variants. It has its origins in the tf.idf ranking function and is widely used as the basis for enterprise search applications

Boolean Operators

A widely used approach to create search queries; examples include And, OR, and NOT—for example, information AND management

Boolean search

A search query using Boolean operators

Boosting

Changing search ranking parameters to ensure that certain documents or categories of documents appear higher in the results than the raw algorithm would suggest.

Chatbot

A chatbot application is able to conduct a voice query against a search index in lieu of providing direct contact with (for example) a call-centre operator

Categorization

The placing of boundaries around objects that share similarities (e.g., taxonomy)

Clustering

A process employed to generate groupings of related words by identifying patterns in a document index

Cognitive search

A description loosely applied by search vendors to applications using machine learning and AI techniques to determine the work context of the user and deliver personalized results

Collection

A group of objects methodically sorted and placed into a category

Computational linguistics

The use of computer-based statistical analysis of language to determine patterns and rules that aid semantic understanding

Concept extraction

The process of determining concepts from text using linguistic analysis

Connector

A software application that enables a search application to index content in another application

Controlled vocabulary

An organized list of words, phrases, or some other set employed to identify and retrieve documents

COTS

Commercial off-the-shelf software

Conversational search

Conversational search applications respond to a spoken request or query with a spoken response. See also Chat Bot.

Crawler

A program used to index documents

Cross-language search

A query in one language is translated into other indexed languages (often using a multi-lingual thesaurus) so that all documents relevant to the concept of the query are returned no matter what language is used for the content

Deep learning

Deep learning builds on machine learning principles but makes use of artificial neutral networks to be able to manage very large collections of data with real-time responses

Description

A brief summary, often generated automatically, that provides a description of a document in the list of results

Glossary of Search Terms

Glossary of search terms

January 2023

Get in touch

Our Address

Follow us