
Researcher claimed that the new software has ability to repair about 90 percent of broken links from web of data provided the resources are still on the site's server.Everyone knows the frustration of following a link to an interesting web site only to discover the target page is no longer there and to be presented with an error page, Iranian researchers said.
The most frustating thing is 404 error page.Web users often looking for interesting websites, interesting data but they are getting 404 page that downs the mood of searching the data.In the world of websites, if some resource is not in the directory , front end users get 404 error.The new developed algorithm first searches at backend if the resource is still available or not, if it is available then it relocated to user.But there is no option for manually deleted resources.
Computing engineers Mohammad Pourzaferani and Mohammad Ali Nematbakhsh of the University of Isfahan explained that previous efforts to address the issue of broken links in the web of data have focused on the destination point.
Their method creates a superior and an inferior dataset that lets them create an exclusive data graph which can be monitored over time in order to identify changes and trap missing links as resources become detached.
"When the broken link is detected the algorithm starts its task to find the new location for detached entity or the best similar candidate for it. To this end, the crawler controller module searches for the superiors of each entity in the inferior dataset, and vice versa. After some steps the search space is narrowed and the best candidate is chosen," said Pourzaferani.
To demonstrate their algorithm, the engineer tested it on two DBPedia snapshots which has approximately 300,000 person entities. Results showed that the algorithm was able to identify about 5000 entities between the two sets of snapshots and successfully relocated 9 out of 10 of the broken links. The details are reported in the International Journal (InderScience)Web Engineering and Technology.