Explain the text indexing
Text indexing is defined as the process of converting a text or text into a list of words. Since a text or ants are given as unstructured forms by themselves or themselves essentially, it is almost impossible to press its raw form directly by using a computer program. In other words, text indexing means the process of segmenting a text which consists of sentences into included words. A list of words is the result of indexing a text as the output of text indexing and will become the input to the text representation.
Let us consider the necessities of text indexing. Text is essentially the unstructured data unlikely the numerical one, so computer programs cannot process it in its raw form. It is impossible to apply numerical operations to texts and is not easy to encode a text into its own numerical value. A text is a too-long string which is different from a short string, so it is very difficult to give the text its own categorical value. Therefore, what is mentioned above becomes the reason for the need to segment an text into words that are short strings as the process of text indexing.
Comments
Post a Comment