With the web being flooded with information about every topic, the problem of
identifying the correct information from the websites poses a challenge to the
researchers. The end user, who searches for information about a topic, depends on the
ranking of the websites provided by search engines. These rankings are generally based
on the structure of the website (centrality) or the page hits (HITS/Page Rank). The
content of the websites are also considered as keywords (Tf-Idf). Even with the inclusion
of all these factors, the underlying problem of finding genuine information still holds.
Even some top-ranked websites provide information which is not correct, while some
less popular websites, ranked lower on the famous search engines, are found to provide
more genuine and correct information about a user’s query.
According to Mashable, in a survey conducted by Harris Interactive in 2012, 98% of
Americans distrust information found on the Internet, with 94% saying, “bad things can
happen as a result of acting on inaccurate information online” (Mancx Survey, 2012).
|