diff --git a/talks-public/2022-09-28-ese-research/2022-09-28-ese-research.org b/talks-public/2022-09-28-ese-research/2022-09-28-ese-research.org index 1e45f9e..c5ab101 100644 --- a/talks-public/2022-09-28-ese-research/2022-09-28-ese-research.org +++ b/talks-public/2022-09-28-ese-research/2022-09-28-ese-research.org @@ -1,30 +1,67 @@ #+COLUMNS: %40ITEM %10BEAMER_env(Env) %9BEAMER_envargs(Env Args) %10BEAMER_act(Act) %4BEAMER_col(Col) %10BEAMER_extra(Extra) %8BEAMER_opt(Opt) #+TITLE: Empirical Software Engineering Research with Software Heritage #+BEAMER_HEADER: \date[2022-09-28]{28 September 2022} #+BEAMER_HEADER: \title[Empirical Software Eng. Research with Software Heritage]{Empirical Software Engineering Research with Software Heritage} #+AUTHOR: Stefano Zacchiroli #+DATE: 28 September 2022 #+EMAIL: stefano.zacchiroli@telecom-paris.fr #+INCLUDE: "../../common/modules/prelude-toc.org" :minlevel 1 #+INCLUDE: "../../common/modules/169.org" #+BEAMER_HEADER: \institute[Télécom Paris]{Télécom Paris, Polytechnic Institute of Paris\\ {\tt stefano.zacchiroli@telecom-paris.fr}} #+BEAMER_HEADER: \author{Stefano Zacchiroli} * Datasets ** Graph dataset #+INCLUDE: "../../common/modules/dataset.org::#graphdataset" :only-contents t ** Graph dataset --- example #+INCLUDE: "../../common/modules/dataset.org::#graphquery1" :only-contents t ** License dataset #+INCLUDE: "../../common/modules/dataset.org::#licensedataset" :only-contents t * Accessing source code artifacts ** The Software Heritage Filesystem (SwhFS) #+INCLUDE: "../../common/modules/swh-fuse.org::#oneslide" :only-contents t ** The Software Heritage Filesystem (SwhFS) --- example #+INCLUDE: "../../common/modules/swh-fuse.org::#examplemini" :only-contents t ** Graph compression #+INCLUDE: "../../common/modules/graph-compression.org::#oneslide" :only-contents t * Software provenance and evolution +** Software provenance and evolution + TODO * Software forks +** Software forks + TODO * Diversity, equity, and inclusion +** Diversity, equity, and inclusion +*** Idea + Archived commit metadata contains public information that can be mined to + study long-term trends of diversity, equity, and inclusion (DEI) traits of + the global population of public code contributors. + +*** Key findings on the gender gap + - Male authors contributed 92% of public code commits up to 2019. + - The ratio of female authors (and their contributions) has grown stably + for 15 years reaching for the first time 10% of yearly contributions + in 2019. + - The COVID-19 pandemic has reversed the trend. + +** Diversity, equity, and inclusion (cont.) + +*** Key findings on the geographic gap + - The early decades of public code were dominated by contributions from + North America, followed by a period of alternating dominance between + North America and Europe. + - Since then geographic diversity has increased constantly, with raising + importance of contributions from Central and South America. + - The trend of increased female contributions is almost worlwide, with the + notable exception of specific regions of Asia were it is either slower or + flat. + +*** References + #+BEAMER: \footnotesize + - Zacchiroli. /Gender differences in public code contributions: a 50-year + perspective/. IEEE Software, 2021 + - Rossi and Zacchiroli. /Worldwide gender differences in public code + contributions/. ICSE SEIS, 2022 + - Rossi and Zacchiroli. /Geographic diversity in public code + contributions/. MSR 2022