Why text analytics is so important in search

A Search Engine's Best Friend: Text Analytics
The sophistication of search tech continues to grow

Choosing the right keywords for search is the most important component of getting the results you're looking for. Everyone knows this, but it's easier said than done. Even with the most well thought out keywords, search results don't always deliver what you're expecting.

Improving the accuracy of search is of utmost importance to companies like Google and Yahoo, and one of the best ways to do this is to incorporate text analytics (AKA text mining) into the back end.

Let's take a typical enterprise search engine and break down the steps that go into an actual search. First, a database of unstructured content is fed into a pipeline, where it is converted into a structured document. That document is then fed into an index, and when a person queries the index, results appear.

Text analytics occurs within the pipeline, before the content is indexed, where it analyses the content and extracts meaningful metadata such as entities being discussed, sentiment, and themes.

The information gained from the text mining process can then be used to create a more efficient search. A common tool for this purpose is faceted search. Any time you've used an advanced search option while using a search engine, you've been using faceted search. It is particularly useful because it enables cross-referencing through all of that metadata.

Faceted search engines come in a variety of complexities and flavours. Major retail websites use rudimentary faceted search to narrow down the categories in which you are searching, while databases such as ones for academic or legal documents may have a more complex set of cross referencing tools.

Text analytics is crucial for word sense disambiguation. Word sense disambiguation is the process of determining what meaning of a word that has multiple definitions is being used in a sentence.

In a typical string based search engine, a search for a term with multiple definitions is going to yield results for all possible uses of the word. Using text mining, the context of the rest of the sentence or phrase in which the word is located is used to determine what the word refers to, when that knowledge is applied to search, it improves the relevance of search results.

More than anything, text mining's power in search is that it allows you to ask more general questions like "who's hot and who's not?" and "is there any breaking news I need to know?" and get results that actually answer those questions.

All in all, the ability to add context and extract metadata from unstructured content before it is indexed makes search engines a far more powerful tool.

  • Mekkin is the marketing manager at Lexalytics, a company that specialises in text mining and sentiment analysis technology.
Latest in Pro
Branch office chairs next to a TechRadar-branded badge that reads Big Savings.
This office chair deal wins the Amazon Spring Sale for me and it's so good I don't expect it to last
Saily eSIM by Nord Security
"Much more than just an eSIM service" - I spoke to the CEO of Saily about the future of travel and its impact on secure eSIM technology
NetSuite EVP Evan Goldberg at SuiteConnect London 2025
"It's our job to deliver constant innovation” - NetSuite head on why it wants to be the operating system for your whole business
FlexiSpot office furniture next to a TechRadar-branded badge that reads Big Savings.
Upgrade your home office for under $500 in the Amazon Spring Sale: My top picks and biggest savings
Beelink EQi 12 mini PC
I’ve never seen a PC with an Intel Core i3 CPU, 24GB RAM, 500GB SSD and two Gb LAN ports sell for so cheap
cybersecurity
Chinese government hackers allegedly spent years undetected in foreign phone networks
Latest in News
Open AI
OpenAI live stream - could we see a major ChatGPT upgrade?
Apple WWDC 2025 announced
Apple just announced WWDC 2025 starts on June 9, and we'll all be watching the opening event
Hornet swings their weapon in mid air
Hollow Knight: Silksong gets new Steam metadata changes, convincing everyone and their mother that the game is finally releasing this year
OpenAI logo
OpenAI just launched a free ChatGPT bible that will help you master the AI chatbot and Sora
NetSuite EVP Evan Goldberg at SuiteConnect London 2025
"It's our job to deliver constant innovation” - NetSuite head on why it wants to be the operating system for your whole business
Monster Hunter Wilds
Monster Hunter Wilds Title Update 1 launches in early April, adding new monsters and some of the best-looking armor sets I need to add to my collection