Per Wookieepedia sourcing policy, Rule 10, and since the Mofference from March 6, 2016, and the Consensus Track of May 2, 2017:
When citing an external link, a permanent archival link must also be included in the citation template if available. Use Internet Archive's Wayback Machine for this purpose. If Wayback Machine's "Save Page Now" feature does not work for a specific case, use archive.today instead.
As a small reminder as to why this rule is really important, consider that every internet page cited on Wookieepedia could (and some already have) be deleted at any moment, if not just having their url modified (aka starwars.com favorite sport). We need archive links (aka backup) to preserve the sustainability of attribution and verifiability principles, which are the pillars of any respectable encyclopedia.
However, this decision created an unavoidable backlog of maintenance work, and in September 2017, an automated category was created to list every pages needing the addition of archive link(s), Category:Pages with missing permanent archival links. As of today, more than 15 000 articles are still listed here.
With the creation of this topic, I hope to increase awareness about archive links for as many editors as possible, in the hope that we could work together toward the completion of this essential maintenance work. It is my belief that a concentrated effort could tackle this task in less than a year. Below, you'll find useful tolls, as well as tips. The topic will be updated when necessary. Feel free to ask any question regarding missing archive links in the Comments section below.
Contents
The task[]
- You can organize your work between "regular" edits and missing archive edits, as I do. No need for anyone to focus only on this, as long as some work is done. A page with a single missing link (like File page) take up to 30sec to edit, if not less, and is perfect for the least experienced among us. A medium size page, between 2 to 5 minutes. Large pages can need between 15 to 30 minutes of work, but a large page doesn't always mean lot of work.
- Either choose a page from the category list, or whenever you spot the tag (in the category list at the end of an article) on a page.
- Use the Category:Internet citation templates to find what is required by the template. You'll find that some need only the "archiveurl=" parameter, and others only "archivedate=". The Cite web template needs both parameters.
- To find an archive link, you have two possibility:
- Use our centralized backup link repository, and with a simple Ctrl=F (the search function), you'll be able to see if a backup link is already listed for the next bot run.
- Use Internet Archive, insert the url you want to find a link for in the Wayback Machine. Copy the url of the page you then accessed (the most recent archive) and put it in "archiveurl" parameter. If there is none available, and the page you want to archive is still live, then proceed to create the archive yourself, as proposed by the Wayback Machine.
- WARNING: Some sites changed their url over time, like the official Star Wars website. See the Tips section below to find way to reach a live version of a page.
- Example: Let's say I want to make an archive link of Wookieepedia:Sourcing (https://starwars.fandom.com/wiki/Wookieepedia:Sourcing), Internet Archive will provide this url: https://web.archive.org/web/20200120123734/https://starwars.fandom.com/wiki/Wookieepedia:Sourcing. The whole url needs to be put in the "archiveurl" parameter. The number between https://web.archive.org/web/ and /https://starwars.fandom.com/wiki/Wookieepedia:Sourcing, 20200120123734, is to be inserted in the "archivedate" parameter.
- If there is no way to provide an archive link, as it is for EA Battlefront (2015) official website, LinkedIn profiles, and some news sites out there who block/mislead bots, and if you're absolutely sure (after checking if archive.today can handle it, which sometimes works), you can insert "nobackup=1" parameter at the end of some internet citation template. Make sure to check if it works, as not all template supports this parameters, and the documentation sometimes don't mention it at all. If the website is still live, please provide a screenshot for reference, and make sure to add it to Wookieepedia:Website screenshots.
- DO NOT do this for Star Wars ({{SW}}) pages, as there is probably a way to find it.
- Please add all backup links to the centralized backup link repository. This will enable EcksBot to add backup links to all pages using the same citation template. Please read the instructions on the page carefully.
Tips[]
This section will be updated, don't hesitate to share your tips in the Comments section, and it we be added there.
- General
- On large article, don't hesitate to open the "Appearances" or "Sources" in the Source edit mode to easily spot if a missing link is located there, as the tag will appear at the bottom of the preview.
- Always keep open a live version of a page you're editing, as it will be much easier to access links that way, since we often cut link in template (ex: Amazon).
- When clicking on a link on Wookieepedia, you might end up elsewhere because of a redirect. To avoid this, right click on the link on copy the url directly on Internet Archive, or copy it directly from the editing page.
- Archive Internet: The site is not without bugs and quirks.
- Sometimes, after creating an archive, the wayback machine will not display it. It best to close the page and wait to try later, as it will block any other attempt on the same page for 10 minutes.
- When using the search while on an archived page, the archive can mix things up and display an error message. When this happen, make sure to check the archive calendar using https://web.archive.org/web/*/ (Example: https://web.archive.org/web/*/https://starwars.fandom.com/wiki/Wookieepedia:Sourcing)
- Be careful about redirects, as the site also archive them, and they can hinder your search. See the previous tip on how to avoid this.
- When using the archive calendar, always go for dates in blue, as other colors will link to error page and redirect.
- File page bug: Updating a File page (image) on Wookieepedia while adding a link with "https://" will trigger an error page when trying to save your edit. This can be avoided by replacing "https://" with "{{subst:https}}".
- StarWars.com: The official website has changed its url structure a few times along the years.
- If you are working on pages related to The Clone Wars, be sure to check the The TCW episode guide dilemma, to properly understand the conundrum between the various version of Episode Guide.
- Most of the deleted pages are listed on Wookieepedia using the Blog or SWArchive template.
- If faced with an error message, try a simple search with the name of the article you're trying to find, more often than not, you'll find it.
- If you see that the url ends with ".html", then you know it's an old one. Sometimes, just removing it will let you access the live version.
- Some pages will have redirect from the old url to the new one (and sometimes it won't, don't ask...), please update the link on Wookieepedia accordingly.
- List of known partial url modification (old > new):
- tv-shows/ > series/
Organization[]
As I know I'm already not alone in working on missing archive links, it seems important for those among us who work on medium or large pages to organize to avoid hinder each other's. Something like that could happen if we all follow the same work logic, like for example, tackle the category list in alphabetical order, which would inevitably lead to people working on the same page at some point. To avoid this, and thanks to the table of content on Category:Pages with missing permanent archival links, I propose that those interested in doing the heavier work choose each a letter to work on, and add their name here:
- NanoLuukeCloning facility 14:44, March 17, 2020 (UTC)
- I've handled adding random missing backup links I've come across for a while now, happy to do it for a project. Fan26 (Talk) 14:47, March 17, 2020 (UTC)
- Been doing this for a few weeks, ain't stopping now. UberSoldat93 (talk) 15:33, March 17, 2020 (UTC)
- 1358 (Talk) 18:23, March 17, 2020 (UTC)
- 01miki10 Open comlink 20:42, March 19, 2020 (UTC)
- Zed42 (talk) 00:07, March 22, 2020 (UTC)
- Master
Fredcerique 05:52, March 23, 2020 (UTC)
- Mr Star Wars Amino Republic (talk) 20:13, June 2, 2020 (UTC)
OOM 224 ༼༽{talk}༼༽ 21:01, June 17, 2020 (UTC)
List of all unique instance of some templates, so it easier to find and work on than to rummage in the articles:
Progress[]
Here's a little demonstration of the amazing progress we've made so far:
|
We are currently working on 111 pages with missing permanent archival link(s). Keep up the good work, everyone!
Comments[]
NB: If you want a fast response, don't hesitate to reach out the Wookieepedia IRC.
- We really need to change any references to WebCite on policy and project pages, as they no longer create new archives. (However, old archives are available for the time being.) We can probably safely replace it with archive.today (aka archive.is / archive.fo / archive.vn, etc.). -- Darth Culator (Talk) 15:05, March 17, 2020 (UTC)
- This is an excellent initiative. I encourage everyone participating to add their links to the centralized backup repository; adding backup links is a perfect task for bots. 1358 (Talk) 18:23, March 17, 2020 (UTC)
- As of this morning, I have added the automatic category to {{SWE}} and {{DB}} as well, which means that we are up to more than 17,000 pages with missing permanent archival links. 1358 (Talk) 11:09, March 22, 2020 (UTC)
The TCW episode guide dilemma[]
OK, so here's a problem: starwars.com is, as everyone knows, a huge mess. Those familiar with TCW episode guides will know that there have been many versions over the years:
Template | Years active | Legends | Canon | List on Legends articles | List on Canon articles |
![]() |
2008–September 12, 2011 | Yes | Yes* | Yes | Yes |
![]() |
September 13, 2011–June 30, 2014 | Yes | Yes* | Yes | No |
![]() |
July 1, 2014–current | No | Yes | No | Yes |
Currently we are treating both number 1 and number 3 as Canon seeing as the original episode guides were adapted directly from the script (example: Marg Sabl) and IMO there's no reason we shouldn't treat number 2 as Canon as well by that logic. The good news is that number 2 provides pretty much no information (starwars.com went through its "completely useless" phase) and number 3 is pretty much just a continuation/update of number 2. So for Canon articles, I think it's pretty simple: list both number 1 and 3, replace any instances of number 2 with 3. The problem is how we handle Legends articles. So far people participating in this project have been replacing number 2 with 3 on Legends articles as well, and this is a problem because number 3 was published after the Canon decision and is thus not Legends. The good news is that number 1 is by far the most comprehensive version, so it's not like we've Legends'ified any information by including number 3 in Legends articles. As far as I am concerned, this is how we should be handling this:
- Canon articles: List number 1 (sorting date 2011-09-12; to my knowledge this is the oldest Sources entry you will find on Canon articles, so it goes first in the list) and number 3 (sorting date 2011-07-01, after {{SWArchive}} and {{SWE}}, or publication date for newer TCW episodes).
- Legends articles: List number 1 (sorting date by episode guide publication date, usually the episode publication date) and number 2 (sorting date 2011-09-12 or when the episode guide was published (for episodes oublished later than 2011-09-12)).
Thoughts? 1358 (Talk) 23:59, March 21, 2020 (UTC)
- Good you thought about this (and that's why experience is important, folks ^^). Too bad we already changed some. If I can resume, the most important thing for us here is to make sure we keep Legends page with url styled "explore/" and Canon with url styled "series/" (and "tv-shows/", that needs to be updated each time it's spotted), and don't modify SWArchive with Episode Guide. So please, don't go and and add link that would changes explore/ to series/ in the repository. If needed (for example: a Canon pages with url styled explore/), do it manually. --NanoLuukeCloning facility 12:33, March 22, 2020 (UTC)
DB,SWE, TORweb, and SWArchive[]
As of this morning, I have added the automatic categorization feature to {{DB}} and {{SWE}}, which has resulted in a bump from under 14,000 articles to around 18,000 articles. This can feel a bit demotivating, but this should actually be a fairly quick endeavor. In an effort to make things easier, I have created a list of all unique instances of the templates on the wiki (lists moved in Organization).
Going through these lists and adding links to the centralized repository should be fairly trivial. The rendered lists can be used for easy access to a backup link that usually works. 1358 (Talk) 11:33, March 22, 2020 (UTC)