![]() |
Wikimedia engineering report, January 2014 | February 2014![]() |
Engineering metrics in January:
- 159 unique committers contributed patchsets of code to MediaWiki.
- The total number of unresolved commits went from around 1386 to about 1320.
- About 22 shell requests were processed.
Major news in January include:
- the transition of our search engines on Wikimedia sites to ElasticSearch;
- a presentation of how the Tech newsletter works, including a historical perspective;
- an invitation to comment on the 2-year vision of the multimedia engineering team;
- a request for comments on whether the MP4 video format should be supported on Wikimedia sites.
Note: We're also providing a shorter, simpler and translatable version of this report that does not assume specialized technical knowledge.
Upcoming events
There are many opportunities for you to get involved and contribute to MediaWiki and technical activities to improve Wikimedia sites, both for coders and contributors with other talents.
For a more complete and up-to-date list, check out the Project:Calendar.
Date | Type | Event | Contact |
---|---|---|---|
1 February 2014–2 February 2014 | ![]() |
FOSDEM - Brussels, Belgium | QuimGil |
13 February 2014–14 February 2014 | ![]() |
NYC Enterprise MediaWiki Hackathon - New York, NY, USA | Yaron Koren |
Personnel
Work with us
Are you looking to work for Wikimedia? We have a lot of hiring coming up, and we really love talking to active community members about these roles.
- VP of Engineering
- Software Engineer - Growth
- Software Engineer - VisualEditor (Features)
- Software Engineer - Language Engineering
- Software Engineer
- Software Engineer - Mobile (Android Apps)
- QA Automation Engineer
- Test Infrastructure Engineer
- Analytics - Product Manager
- Director of Community Engagement (Product)
- Sr. Operations Engineer
- Operations Security Engineer
- User Experience Research Lead
Announcements
- Shahyar Ghobadpour joined the Wikimedia Core features team as Software Engineer (announcement).
- David Chan joined the Language Engineering team as Software Engineer (announcement).
- Gilles Dubuc joined the Multimedia team as Senior Software Engineer and Tech Lead (announcement).
- Charles Salvia joined the Analytics team (announcement).
- Sam Smith joined the Features Engineering team as Software Engineer, working with the Growth team (announcement).
- Preteek Saxena joined the UX Design team as UX Prototyping contractor (announcement).
Technical Operations
- The Wikimedia Operations team is in the final stages of the selection process. A short list of 4 bids has been created and final negotiations are underway. The winner of the bid will be selected in February based on the technical criteria listed in the RFP, and pricing.
Labs metrics in January:
- Number of projects: 131
- Number of instances: 441
- Amount of RAM in use (in MBs): 1,734,144
- Amount of allocated storage (in GBs): 23,505
- Number of virtual CPUs in use: 867
- Number of users: 2,595
- The Labs Migration team, consisting of Andrew Bogott and Marc-Andre Pelletier, have made good progress with testing the newest version of Openstack (called Havana) and with Neutron, an OpenStack project to provide "networking as a service". The plan is to upgrade the Openstack software when we migrate the Labs infrastructure out of the Tampa data center.
Features Engineering
Editor retention: Editing tools
Part of the team has been mentoring two Outreach Program for Women (OPW) interns. Others are mentoring a group of students in a Facebook Open Academy project to build a Cassandra storage back-end for the Parsoid round-trip test server.
We also participated in the architecture summit, where our RFCs about embracing a service architecture, PHP bindings for services, a general-purpose storage service based on our Rashomon revision store, and a public content API based on this were well received.
Following up on this, we started Debian packaging for Parsoid, which will soon make the installation of Parsoid as easy asapt-get install parsoid
.Core Features
Growth
Support
Mobile
Language Engineering
Language engineering communications and outreach
Platform Engineering
MediaWiki Core
Site performance and architecture
ProfilerMwprof
profiler class for MediaWiki, and draft performance guidelines.Security auditing and response
Quality assurance
Quality Assurance/Browser testing
Multimedia
Engineering Community Team
Project management tools/Review

Wikimedia's first participation in the Google Code-In program ended up with great success: 273 tasks completed by 46 students with the help of about 30 mentors. Theo Patt and Mateusz Maćkowski were selected winners for Wikimedia, and we sent a special mention to Mayank Madan.
Round 7 of the FOSS Outreach Program for Women started and all projects are on track so far:
- Compacting interlanguage links
- MediaWiki Homepage Redesign
- Complete the MediaWiki API development course on Codecademy
- Clean up Parsoid round-trip testing UI
- Clean up tracing/debugging/logging inside Parsoid
- UploadWizard: OSM Embedding
Volunteer coordination and outreach
Analytics
Analytics/Logging infrastructure

We completed research on article creation trends on the largest Wikipedias and found substantial differences between different language Wikipedias; specifically, where anonymous editors are allowed to create articles, their success rate (% of articles kept) is substantially higher than that of newly registered editors. We also found that articles that started as Articles for Creation (AfC) and userspace drafts have a near 100% success rate, but the transition that English Wikipedia made toward directing newcomers to start AfC drafts appears to have substantially reduced the amount of successful articles created by newcomers, presumably due to the large review backlog.
We published an update on Visual Editor usage on Wikipedia projects where the editor is enabled by default.
We continued work on metrics standardization for the editor engagement vital signs project and published supportive analysis on definitions and parameter exploration for two proposed standardized user classes: new editor and productive new editor.
We worked with the Analytics Development and Legal teams to articulate use cases and the retention and anonymization strategy for data subject to the retention guidelines, in particular with respect to user agents.
We welcomed Sahar Massachi as a research contractor supporting the team with data analysis for fundraising tests and iterated on new modeling strategies for estimating test success (such as the number of dollars per banner impression). Before he joined us, Sahar worked with the fundraising team, where most recently he focused on writing tools to help the team easily and quickly understand the results of each test.Offline
- The Kiwix project is funded and executed by Wikimedia CH.
- Much time this month was spent planning for 2014. We mainly worked on mwoffliner and almost managed to create a full English Wikipedia ZIM file with thumbnails. The upgrade of our main storage platform allowed us to start our automatic ZIM file generation system.
Wikidata
The Wikidata project is funded and executed by Wikimedia Deutschland.
- In January, the team worked mainly on performance improvements around Wikidata. The Quantities datatype was deployed so it is now possible to enter data like the number of inhabitants of a country. Wikisource can now manage its language links via Wikidata as well just like Wikipedia, Wikivoyage and Commons could already. Two new front-end developers, Adrian and Thiemo joined the team to help improve Wikidata's user interface. Last but not least, the team released their plan for the development of Wikidata in 2014 and beyond.
Future
- The engineering management team continues to update the Deployments page weekly, providing up-to-date information on the upcoming deployments to Wikimedia sites, as well as the annual goals, listing ongoing and future Wikimedia engineering efforts.