[21:27:32] Nettrom: o/ [21:27:49] leila: hey, what's up? [21:28:49] Nettrom: during the research office hours today, a participant asked if we expose the number of newcomers who have done their first edits (activated) somewhere easily accessible (per month, I assuem). [21:29:11] Nettrom: do you know of any place or easy way for them to compute the number? [21:29:39] Lydia_WMDE: o/ [21:31:30] leila: hmm, I'm unsure if we're reporting new editors or new active editors in the monthly board metrics, let me check [21:31:51] Nettrom: thanks! :) [21:32:22] (I looked at your calendar to see if you're available and it told me that you're in another office hour. hence the late follow-up.) [21:34:16] Nettrom: fwiw, the question stems from another question which is: whether the number of newly activated editors have significantly changed during covid-19(I assume they want to dig in it more than just yes/no). [21:34:53] leila: you're good, I wouldn't have responded if I didn't have time :) [21:35:06] leila: the monthly movement metrics only reports new active editors: https://upload.wikimedia.org/wikipedia/commons/d/d5/September_2020_Wikimedia_movement_metrics_%281%29.pdf [21:35:23] September 2020 is the most recent, October numbers will arrive soon [21:36:18] it's movement-wide, but there's the COVID bump in April/May/June [21:37:49] Nettrom: thanks. I'll share that link as well. [21:38:38] Nettrom: are the monthly movement metrics reported somewhere they can access publicly? (and see the logic/definitions used to compute it)? [21:39:31] leila: I don't think they're readily available in a combined form publicly, but instead have to be dug out of the PDFs [21:39:51] the code that computes them is on GitHub, let me dig that up [21:40:23] Nettrom: I see. that's already helpful (they will know they have to compute it.) the code will be very helpful. thanks! (and I'll give you all the credit for the response. thanks for helping me find an answer for them.) [21:42:17] leila: the code is here: https://github.com/wikimedia-research/Editing-movement-metrics It does run on internal data sources (e.g. mediawiki_history), but it documents how the metrics are measured [21:43:55] Nettrom: great. thanks. And I have one final question for you: is there a standard way that you (the Growth team) measures retention today? [21:44:41] leila: yeah, we require the user to make an edit in the first 24 hours ("activation") and then edit again in the following two weeks [21:44:45] Nettrom: miriam shared https://meta.wikimedia.org/wiki/Research:Editor_retention and I wonder if that's what you all use or there are more updates/improvements to it. Also, if you have any code for that, that'll be helpful, too. [21:46:13] leila: the Growth team's metrics are geared towards short-term activity, whereas "new editor retention" in the movement metrics is second-month retention [21:53:18] leila: I pushed my data gathering notebook from the Growth team's analysis fo the Homepage last year to GitHub [21:53:24] https://github.com/nettrom/Growth-homepage-2019/blob/master/analysis_datasets.ipynb [21:53:53] while it's written to query MediaWiki history, querying the replicas for a specific wiki on Toolforge would be similar [22:00:33] Nettrom: MediaWiki history itself is public as a monthly dump, right? [22:00:49] leila: ah yes, it is now! [22:01:00] yeah. so your code is even more helpful. thanks! [22:01:36] BTW, the dumps are here, in case you need that shortcut too :) https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Edits/Mediawiki_history_dumps [22:01:48] *described here [22:02:06] awesome! happy to help out, let me know if there are follow up questions about this :) [22:06:33] Nettrom: will do. and FYI, it's topic 2 in https://etherpad.wikimedia.org/p/Research-Analytics-Office-hours (currently line 38+). Feel free to edit directly if you want to improve it. [22:06:41] thanks, Nettrom . [22:08:08] topic 1* [22:09:51] leila: thanks! I'll make a note to take a closer look at it later. BTW, I browsed through and noticed Jan Ainali's question about media [22:10:08] the Structured Data team is also interested in that, and I know Miriam's also work on it [22:10:10] Nettrom: yes. thoughts on that? ;) [22:10:12] let me dig up the phab task I have [22:10:53] Nettrom: excellent. thanks. (and miriam was in the call and shared context and soem info. she has a todo lower in that page so feel free to share the link under her todo for her to see it and include it.) [22:11:06] leila: cool, I'll do that!