![]() |
Wikimedia engineering report, June 2013 | July 2013![]() |
Engineering metrics in June:
- 116 unique committers contributed patchsets of code to MediaWiki.
- The total number of unresolved commits remained stable around 960.
- About 30 shell requests were processed.
- Wikimedia Labs now hosts 166 projects and 1520 users; to date 2052 instances have been created.
- The tools project in Labs now hosts 222 tools and 193 members.
Major news in June include:
- The preparation for the activation of VisualEditor to most Wikipedia sites, and its debut on the English Wikipedia;
- News around Language engineering, including the preparation and activation of the Universal Language Selector on many wikis;
- An explanation of how bugs are discovered and fixed.
- A retrospective on the Amsterdam hackathon.
Note: We're also providing a shorter, simpler and translatable version of this report that does not assume specialized technical knowledge.
Upcoming events
There are many opportunities for you to get involved and contribute to MediaWiki and technical activities to improve Wikimedia sites, both for coders and contributors with other talents.
For a more complete and up-to-date list, check out the Project:Calendar.
Date | Type | Event | Contact |
---|---|---|---|
3 July 2013 | ![]() |
Tech Talk: Solr based search | Quim Gil |
11 July 2013 | ![]() |
WMF Metrics and Activities meeting | Erik Möller |
16 July 2013 | ![]() |
IRC office hours for bug management | Andre Klapper |
Personnel
Work with us
Are you looking to work for Wikimedia? We have a lot of hiring coming up, and we really love talking to active community members about these roles.
Announcements
- Sean Pringle joined the Technical Operations team as our Storage and Database Engineer (announcement).
- Brian Wolff joined the Wikimedia Platform Engineering group as Software developer for the Summer, working on multimedia contribution and review (announcement).
- Ken Snider joined the Technical Operations team as an international contractor, poised to fill the Director of Technical Operations position (announcement).
- Toby Negrin joined the Engineering department as Director of Analytics (announcement).
Technical Operations
Site infrastructure
- As part of our capacity planning work, Mark Bergsma upgraded most of our Varnish infrastructure (in EQIAD & ESAMS) with newer and faster servers. He will be adding new mobile Varnish servers in ESAMS next, this coming month. Rob Halsell and Daniel Zahn are pushing ahead with the migration of the other applications from Tampa to EQIAD. New Parsoid application and Varnish servers were also deployed in anticipation of the coming VisualEditor deployment. Meantime, Alexandros Kosiaris is starting the backup project work; read more about the project and the technology.
- Mark also put in the finishing touches to deploy all the new network infrastructure at ESAMS. With help from Mark and Leslie Carr, we finally got approval from ARIN for some new IPv4 addresses, needed for our new ULSFO buildup.
- Many people are refactoring Puppet code with the ultimate goal of having everything organized into Puppet modules. Andrew Bogott, Antoine Musso and Alexandros are setting up an automated testing infrastructure to support these efforts.
- Our GSoC student, Petr Onderka, is set up in Gerrit and committed his first contributions to the Incremental Dumps project; you can follow his code, read his progress reports and check the current discussion on the mailing list. Additionally, we hold IRC meetings on weekdays at about 4:15 pm (UTC) in #wikimedia-tech; lurkers and contributors are welcome.
- Wikimedia Labs saw a lot of improvements in June, including the deployment of AJAX improvements for OpenStackManager to wikitech (added actions: console output; improvements: reboot), and a new interface for displaying quotas for projects in OpenStackManager. We ensured that all instances were properly running Puppet and Salt; Many instances were running
puppetmaster::self
and needed to have local puppet repo merges or rebases. We upgraded Salt everywhere and re-issued keys to fix a vulnerability in Salt. The team also worked on stabilizing the NFS server. We've encountered a kernel bug with NFS; we have changed the scheduler from cfq to deadline and have decreased the read and write sizes of clients to 8 kilobytes. Progress has been made towards making the Labs database replicas available to the Labs at large (as opposed to only the Tool Labs project). Last, much work has been done towards user request fulfillment in Tool Labs, including work towards WSGI support.
Features Engineering
Editor retention: Editing tools
Despite being very short-staffed this month (only two full-time developers), the absence of performance issues left us enough time for a lot more polishing before the VisualEditor release on July 1. As a result, the release went very well with clean diffs on almost all pages.
While more work is left to do, it is now clear that we have fundamentally achieved our goal of a clean translation between WikiText and HTML + RDFa. This does not only enable visual HTML editing, but also makes Wikipedia's content easily accessible in a standardized format. It also opens up new opportunities for MediaWiki's core architecture, which we'll pursue this fiscal year.Editor engagement features
Editor engagement experiments
First, the new Campaigns extension was added to all wikis. This analytics tool helps identify internal or external sources of new registrations, by adding a "campaign" name to the signup page URL. This month, E3 began running campaigns to learn about how many anonymous editors sign up on the top 10 Wikipedias, as well as how many sign up via the invitation to "Join Wikipedia" on the login page (see the list of active campaigns and analysis). Another piece of analytics infrastructure by the team is the new CoreEvents extension, which houses logging of MediaWiki core activity, like preference updates and page saves across all projects.
For the Getting Started project, the team conducted usability testing (see results and documentation) of new designs. E3 also refactored and refined the guided tours extension in June, including adding usability enhancements like new interface animations, support for community tours, and bug fixing. The team also planned and began work on an experiment to deliver guided tours to all first-time editors.
The team also assisted with A/B testing and research for VisualEditor before its July 1 launch date, assisting with experimental design, EventLogging instrumentation, and other work. After the VisualEditor launch, E3 started a week-long micro-survey of newly-registered users on English Wikipedia, to give us a first systematic look at the gender diversity of those creating accounts.Support
Mobile
Platform Engineering
MediaWiki Core
Security auditing and response
Quality assurance
Quality Assurance/Browser testing
Analytics
Analytics/Infrastructure
Analytics/Visualization, Reporting & Applications
Engineering community team
- Improving support for book structures
- Mobilize Wikidata
- jQuery.IME extensions for Firefox and Chrome
- Browser Test Automation for Visual Editor
- Incremental dumps
- Refactoring of Extension ProofreadPage
- Language Coverage Matrix Dashboard
- Section Handling in Semantic Forms
- VisualEditor Mathematical Editor Plugin
- ZIM Incremental Updates for Kiwix
- VisualEditor RTL support
- UploadWizard: book upload customization
- Android app for MediaWiki translation
- Pronunciation recording tool
- MediaWiki Moodle extension
- VisualEditor plugin for source code
- Prototyping inline comments
Volunteer coordination and outreach
Kiwix
The Kiwix project is funded and executed by Wikimedia CH.
- Development of a new MediaWiki HTML dumper in nodeJS has started. This tool exports Wikipedia articles in static files based on the Parsoid output. This solution looks really promising, and new JavaScript developers are welcome.
Wikidata
The Wikidata project is funded and executed by Wikimedia Deutschland.
- June in Wikidata was all about the sister projects. The development team published proposals for how Wikidata can support Commons and Wiktionary. Additionally, they worked on the ability of Wikidata to store language links to Wikivoyage in addition to Wikipedia; as a result, Wikivoyage will soon also be able to manage their language links via Wikidata. Another important step was the deployment of the geocoordinate datatype. This makes it possible, for example, to indicate the location of a city. Geocoordinates that are already in Wikidata can be seen on this map (huge version, updated daily).
- In a blog entry, Denny Vrandečić explained his understanding of the relation of Wikidata and the truth.
- In other news, further development of Wikidata has been supported through a large donation by the search engine company Yandex.
Future
- The engineering management team continues to update the Deployments page weekly, providing up-to-date information on the upcoming deployments to Wikimedia sites, as well as the engineering roadmap, listing ongoing and future Wikimedia engineering efforts. Annual goals for the 2013–2014 fiscal year are currently being drafted.