Explain the challenges of mining used in WWW.
Challenges in Web Mining
The web poses great challenges for resource and knowledge discovery based on the following observations:
The web is too huge: The size of the web is very huge and rapidly increasing. The seems that the web is too huge for data warehousing and data mining.
The complexity of web pages: The web pages do not have a unifying structure. They are very complex as compared to traditional text documents. There is a huge number of documents in the digital library of the web. These libraries are not arranged according to ar particular sorted order.
The web is a dynamic information source: The information on the web is rapidly updated. The data such as news, stock markets, weather, sports, shopping, etc., are regularly updated.
Diversity of user communities: The user community on the web is rapidly expanding. These users have different backgrounds, interests, and usage purposes. There are more than 100 million workstations that are connected to the internet and still rapidly increasing.
Relevancy of Information: It is considered that a particular person is greatly interested in an only small portion of the web, while the rest of the portion of the web contains information that is not relevant to the user and may swamp desired results
Comments
Post a Comment