What do you mean by web mining? Discuss the types of web mining.

 WEB MINING

Web mining aims to discover useful information or knowledge from the Web hyperlink structure, page content, and usage data. Although Web mining uses many data mining techniques, as mentioned above it is not purely an application of traditional data mining due to the heterogeneity and semi-structured or unstructured nature of the Web data. Many new mining tasks and algorithms were invented in the past decade. Based on the primary kinds of data used in the mining process,

Web mining tasks can be categorized into three types: Web content mining, Web structure, Web usage mining, Web Mining



Web Content Mining: Web content mining extracts or mines useful information or knowledge from Web page content. Web content mining focuses on the content of the Web pages rather than t links. For example, we can automatically classify and cluster Web pages according to their topic Web content is a very rich information resource consisting of many types of information, for example, unstructured free text, image, audio, video, animation, and metadata as well as hyperlinks. A variety of techniques are therefore needed to retrieve the content of interest. The content of web pages includes no machine-readable semantic information. Search engines, subject directories, intelligent agents, cluster analysis, and portals are employed to find what a user might be looking for. It has been suggested that users should be able to pose more sophisticated queries than just specifying the keywords.

Web Structure Mining: Web structure mining discovers useful knowledge from hyperlinks (or lin for short), which represent the structure of the Web. For example, from the links, we can discover important Web pages, which, incidentally, is a key technology used in search engines. We can all discover communities of users who share common interests. Research has been carried out to mod the Web-based on the topology of the hyperlinks. Traditional data mining does not perform such tasks because there is usually no link structure in a relational table.

Web Usage Mining: Web usage mining primarily deals with understanding user behavior interacting with the Web or with a website. One of the aims is to obtain information that may ass website re-organization or assist site adaptation to better suit the user. The mined data often include data logs of users' interactions with the Web. The logs include the Web server logs, proxy server lo, and browser logs. The logs include information about the referring pages, user identification, time user spends at a site and the sequence of pages visited. Information is also collected via cookie file While Web structure mining shows that page A has a link to page B, Web usage mining shows wh or how many people took that link, which site they came from, and where they went when they left the page B.

The three categories above are not independent since Web structure mining is closely related to Web content mining and both are related to Web usage mining.

Comments

Popular posts from this blog

What are different steps used in JDBC? Write down a small program showing all steps.

Discuss classification or taxonomy of virtualization at different levels.

Pure Versus Partial EC