This document discusses text and web mining. It defines text mining as analyzing huge amounts of text data to extract information. It discusses measures for text retrieval like precision and recall. It also covers text retrieval and indexing methods like inverted indices and signature files. Query processing techniques and ways to reduce dimensionality like latent semantic indexing are explained. The document also discusses challenges in mining the world wide web due to its size and dynamic nature. It defines web usage mining as collecting web access information to analyze paths to accessed web pages.
Related topics: