{"id":2968,"date":"2018-10-06T18:21:45","date_gmt":"2018-10-06T18:21:45","guid":{"rendered":"http:\/\/www.styledeals.co.uk\/blog\/wikipedias-broken-links-fixed-by-the-internet-archive\/"},"modified":"2018-10-06T18:21:45","modified_gmt":"2018-10-06T18:21:45","slug":"wikipedias-broken-links-fixed-by-the-internet-archive","status":"publish","type":"post","link":"https:\/\/www.styledeals.co.uk\/blog\/wikipedias-broken-links-fixed-by-the-internet-archive\/","title":{"rendered":"Wikipedia&#8217;s broken links fixed by the Internet Archive"},"content":{"rendered":"\n<div property=\"articleBody\">\n<figure class=\"media-landscape no-caption full-width lead\"><span class=\"image-and-copyright-container\"><\/p>\n<p>                <img loading=\"lazy\" decoding=\"async\" class=\"js-image-replace\" alt=\"Wikipedia broken link\" src=\"https:\/\/ichef.bbci.co.uk\/news\/320\/cpsprodpb\/16F6\/production\/_103687850_f92c8b3b-ab84-4c59-aca5-aa01079bd1a1.jpg\" width=\"976\" height=\"549\"\/><span class=\"off-screen\">Image copyright<\/span><br \/>\n                 <span class=\"story-image-copyright\">Getty Images<\/span><\/p>\n<p>            <\/span><\/p>\n<\/figure>\n<p class=\"story-body__introduction\">Nine million broken Wikipedia links have been fixed thanks to an alliance with the Internet Archive.<\/p>\n<p>The online encyclopedia&#8217;s editors have long been encouraged to provide links to web-based sources. But the details can be lost if the third-party sites close or update their pages.<\/p>\n<p>To address this, visitors are now pointed to snapshots of what the sites used to show, when required.<\/p>\n<p>It has also emerged that some editors voted to <a href=\"https:\/\/en.wikipedia.org\/wiki\/Wikipedia:Reliable_sources\/Noticeboard\/Archive_248#RfC:_Breitbart\" class=\"story-body__link-external\">restrict use of Breitbart<\/a>.<\/p>\n<p>A post on Wikipedia&#8217;s reliable sources page states that there was a &#8220;very clear consensus&#8221; that the right-wing news site should stop being used as a source for facts &#8220;due to its unreliability&#8221;. <\/p>\n<p>It suggested that Breitbart could still, however, be used to attribute viewpoints.<\/p>\n<p>But some editors were concerned by the idea.<\/p>\n<p>&#8220;Breitbart should be used with caution &#8211; but an outright ban on citing it would hurt Wikipedia far more than help,&#8221; wrote one.<\/p>\n<p>The BBC has contacted Breitbart for a response.<\/p>\n<p>It follows a similar move against the Daily Mail last year.<\/p>\n<p>Volunteers were subsequently encouraged to review existing Wikipedia citations of the UK newspaper and either remove or replace them.<\/p>\n<p><a href=\"https:\/\/motherboard.vice.com\/en_us\/article\/pa9qvv\/wikipedia-banned-breitbart-infowars\" class=\"story-body__link-external\">The Motherboard news site<\/a> has reported that Wikipedia editors have also advocated similar limits on the use of articles by the left-wing Occupy Democrats organisation and the conspiracy-theory media platform InfoWars.<\/p>\n<h2 class=\"story-body__crosshead\">Link rot<\/h2>\n<p>The collaboration with the Internet Archive makes use of the San Francisco&#8217;s based project&#8217;s Wayback Machine tool.<\/p>\n<p>This allows users to enter a web address and then find stored versions of how a page appeared on dates in the past.<\/p>\n<p>The non-profit said it had begun using automated software three years ago to hunt out links that resulted in &#8220;page not found&#8221; or &#8220;404&#8221; and &#8220;500&#8221; errors. <\/p>\n<p>This bot then searched the Wayback Machine for the relevant information and automatically updated the links.<\/p>\n<p>This, the archive&#8217;s director said, had resulted in six million pages lost to &#8220;link rot&#8221; being restored.<\/p>\n<figure class=\"media-landscape has-caption full-width\"><span class=\"image-and-copyright-container\"><\/p>\n<p>                 <span class=\"off-screen\">Image copyright<\/span><br \/>\n                 <span class=\"story-image-copyright\">Internet Archive<\/span><\/p>\n<p>            <\/span><figcaption class=\"media-caption\"><span class=\"off-screen\">Image caption<\/span><br \/>\n                <span class=\"media-caption__text\"><br \/>\n                    The bot was designed to seek out and fix broken links<br \/>\n                <\/span><br \/>\n            <\/figcaption><\/figure>\n<p>Mark Graham added that members of the Wikipedia community had also helped tackle a related issue &#8211; &#8220;content drift&#8221;.<\/p>\n<p>This occurs when a page remains online but its text and images change so that they no longer resemble what the editor who linked to them had intended.<\/p>\n<p>These human volunteers had fixed more than three million links to date, the director wrote.<\/p>\n<p>&#8220;We will expand our efforts to check and edit more Wikipedia sites and increase the speed which we scan those sites and fix broken links,&#8221; Mr Graham concluded, adding that he also intended to explore whether Wikipedia&#8217;s contributors could be encouraged to use Wayback Machine snapshots in the first place rather than live-web links.<\/p>\n<\/p><\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/www.bbc.co.uk\/news\/technology-45730363\">Source<\/a> by <a href=\"\">[author_name]<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Image copyright Getty Images Nine million broken Wikipedia links have been fixed thanks to an alliance with the Internet Archive. The online encyclopedia&#8217;s editors have long been encouraged to provide links to web-based sources. But the details can be lost if the third-party sites close or update their pages. To address this, visitors are now &hellip; <\/p>\n","protected":false},"author":0,"featured_media":2969,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2968","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-general"],"_links":{"self":[{"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/posts\/2968","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/comments?post=2968"}],"version-history":[{"count":0,"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/posts\/2968\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/media\/2969"}],"wp:attachment":[{"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/media?parent=2968"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/categories?post=2968"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.styledeals.co.uk\/blog\/wp-json\/wp\/v2\/tags?post=2968"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}