Why Companies “Democratise” Artificial Intelligence: The Case of Open Source Software Donations

Cailean Osborne
Oxford Internet Institute
University of Oxford
Oxford, UK
[email protected]
Abstract

Companies claim to “democratise” artificial intelligence (AI) when they donate AI open source software (OSS) to non-profit foundations or release AI models, among others, but what does this term mean and why do they do it? As the impact of AI on society and the economy grows, understanding the commercial incentives behind AI democratisation efforts is crucial for ensuring these efforts serve broader interests beyond commercial agendas. Towards this end, this study employs a mixed-methods approach to investigate commercial incentives for 43 AI OSS donations to the Linux Foundation. It makes contributions to both research and practice. It contributes a taxonomy of both individual and organisational social, economic, and technological incentives for AI democratisation. In particular, it highlights the role of democratising the governance and control rights of an OSS project (i.e., from one company to open governance) as a structural enabler for downstream goals, such as attracting external contributors, reducing development costs, and influencing industry standards, among others. Furthermore, OSS donations are often championed by individual developers within companies, highlighting the importance of the bottom-up incentives for AI democratisation. The taxonomy provides a framework and toolkit for discerning incentives for other AI democratisation efforts, such as the release of AI models. The paper concludes with a discussion of future research directions.

Keywords Open source software, artificial intelligence, PyTorch, donations, incentives, AI democratisation

1 Introduction

Companies are increasingly “democratising” artificial intelligence (AI). However, “AI democratisation” remains an ambiguous term, encompassing a variety of goals and methods (Seger et al., 2023b, ), including the release of AI open source software (OSS) (Langenkamp and Yue,, 2022; Srnicek,, 2022), its donation to non-profit foundations (Yue and Nagle,, 2024), and the release of AI models (henceforth: open models) (Osborne et al.,, 2024; Widder et al.,, 2023), among others. While press releases celebrate a myriad of benefits that AI democratisation promises for research and innovation, the commercial incentives driving such efforts are often obscured from public view. Given the ever-increasing impact of AI on society and the economy, understanding the commercial incentives for AI democratisation efforts is crucial so that we can ensure these efforts serve broader societal interests beyond commercial agendas.

Towards this end, this study presents an exploratory investigation of why companies democratise AI with a focus on OSS donations as one method, among others, of AI democratisation. Through a mixed-methods approach, combining the analysis of pre-donation technical pitches, post-donation blog posts, a questionnaire, and semi-structured interviews, it investigates commercial incentives for 43 AI OSS donations to the Linux Foundation (LF), making contributions to both research and practice. It makes contributions to both research and practice. It contributes a taxonomy of both individual and organisational social, economic, and technological incentives for AI democratisation. In particular, it highlights the role of democratising the governance and control rights of an OSS project (i.e., from one company to vendor-neutral, open governance) as a structural enabler for downstream goals, such as attracting external contributors, reducing development costs, and influencing industry standards, among others. Furthermore, it sheds light on the role of individual developers within companies, who champion and coordinate OSS donations, thus highlighting the relevance of the bottom-up incentives for AI democratisation. The taxonomy provides a framework and toolkit for discerning incentives for other AI democratisation efforts, such as the release of AI models (White et al.,, 2024).

The paper is structured as follows. First, it reviews prior work on the political economy of open source AI and OSS (Section 2). Second, it presents the research aims and the study design (Section 3). Third, it reports the key findings (Section 4). Fourth, it discusses the implications of the findings for research and practice as well as the threats to validity (Section 5). Finally, the paper concludes with a summary of the key contributions (Section 6).

2 Related Work

2.1 “Democratising AI”: Narratives and Practices

In the context of concerns about industry concentration and influence on AI research and development (R&D) (Ahmed et al.,, 2023; Kak and Myers West,, 2023; Whittaker,, 2021; Varoquaux et al.,, 2024), it has become en vogue for companies to claim to “democratise AI”—an altruism-laden term that is notoriously ambiguous. Prior work finds that it used as a catch-all term to encompass a variety of goals and practices (Seger et al., 2023b, ), including the following:

  • Democratising AI use: Lowering entry barriers for the use of AI technologies, including but not limited to commercial products like OpenAI’s ChatGPT or GitHub’s Copilot, access to AI models through APIs or publicly available model weights, and the release of AI OSS like PyTorch and TensorFlow.

  • Democratising AI development: Lowering the entry barriers for the development of AI technologies, including but not limited to the release of AI models and AI OSS.

  • Democratising AI profits: Redistributing the economic value accrued to companies from their use of AI technologies to the respective users and impacted populations.

  • Democratising AI governance: Distributing the decision-making power in the development or use of AI technologies to a wider community of stakeholders and impacted populations.

In most cases, AI democratisation is used to refer to the lowering of barriers for the use or the development of AI technologies, leading Seger et al., 2023b to conclude that, “‘AI democratisation’ is a (mostly) unfortunate term”. Open source technologies and collaboration methods have been integral to AI democratisation efforts, offering the means of enabling both access to and participation in the development of AI. Commercial releases of AI OSS (Langenkamp and Yue,, 2022; Srnicek,, 2022) and open models (Osborne et al.,, 2024; Widder et al.,, 2023) have contributed to the rapid growth of the open source AI ecosystem, which now comprises over 300 AI OSS libraries (Haddad,, 2022), hundreds of thousands of open models (Osborne et al.,, 2024), and over a million AI OSS repositories (GitHub,, 2023).

The prevalence of AI democratisation efforts begs the questions of why companies release their AI software and models, and what are the impacts thereof on the norms, practices, and potential trajectories of AI developer communities. Prior work hints at a number of incentives. Scholars contend that industry giants promote open source as an alternative to their concentrated power in the AI industry, whilst using it as a means to shape industry standards, benefit from user innovation, and ultimately extend their influence the norms and tools used by researchers and developers around the world (Widder et al.,, 2023; Srnicek,, 2022). Nick Srnicek argues that “the seemingly non-capitalist practice of releasing their AI software for free in fact obscures a significant capitalist battle between the major companies” (Srnicek,, 2022). This was evident in a leaked Google memo, which claimed that “open source solutions will out-compete companies like Google or OpenAI” and for this reason they should “own the ecosystem and let open source work for us” (Patel and Ahmad,, 2023). As discussed below, this leaked memo highlights the ethical tensions that emerge from company’s attempts to exploit the collective efforts of OSS developer communities (Birkinbine,, 2020; Li et al.,, 2022).

Other companies, such as Meta, have been outspoken about the drivers of their open source AI strategy: by releasing AI software like PyTorch and large language models (LLM) like their LLaMA models, Meta seeks to increase adoption of its technology, improve their performance and safety through distributed feedback and innovation, and ultimately benefit from ecosystem effects. For example, upon releasing LLaMA 2, Nick Clegg, Meta’s President of Global Affairs, explained that “Openness isn’t altruism—Meta believes it’s in its interest. It leads to better products, faster innovation, and a flourishing market, which benefits us as it does many others” (Clegg,, 2023). He argued that releasing LLaMA 2 would make it “safer and better” because it will benefit from the “wisdom of the crowds.” Clegg added that, “A mistaken assumption is that releasing source code or model weights makes systems more vulnerable. On the contrary, external developers and researchers can identify problems that would take teams holed up inside company silos much longer.” Meanwhile Mark Zuckerberg, Meta’s CEO, has explained publicly that Meta seeks to build an ecosystem around their AI software and models as a source of strategic advantage. For example, Zuckerberg explained to shareholders that the widespread use of PyTorch has “been very valuable for us” because it has facilitated their use of external AI research and innovations that use PyTorch (Meta,, 2023). Furthermore, upon the release of LLaMA 3, he explained that they are not doing open source “because we are, like, altruistic… I just want everyone to be using it because the more people who are using it, the more the flywheel will spin for making LLaMA better” (South Park Commons,, 2024). These statements provide some answers to why technology giants like Meta democratise AI, but it remains to be studied systematically why various kinds of companies, including startups, engage in AI democratisation efforts.

There are also concerns about “open-washing” by startups and industry giants alike, who have been promoting open models released under restrictive licenses and with limited transparency as “open source” (White et al.,, 2024; Widder et al.,, 2023; Liesenfeld et al.,, 2023). Liesenfeld and Dingemanse (2024) argue that companies engage in open-washing to reap the benefits of open source (e.g., reputation rewards and adoption), whilst not actually complying with open source standards or norms (e.g., via restrictive licenses). For example, Meta released LLaMA 2 with much fanfare, claiming that the “open source” model would benefit research and innovation, its distribution under a novel license with restrictive commercial terms (i.e., any company with greater than 700 million monthly active users in the preceding month must request a license that Meta may grant in its sole discretion) received backlash from the open source community (White et al.,, 2024; Widder et al.,, 2023; Maffulli,, 2023). For example, Stefano Maffulli, the Executive Director of the Open Source Initiative, commented, “Unfortunately, the tech giant has created the misunderstanding that LLaMA 2 is ‘open source’ – it is not. Meta is confusing ’open source’ with ‘resources available to some users under some conditions,’ [which are] two very different things” (Maffulli,, 2023).

Whether open source AI will deliver its promised equalising effects or lead to further industry concentration remains to be seen. However, it is important to view these open source AI developments in the context of developments and concentrations in the wider AI supply chain or “AI stack”, including the hardware accelerators (i.e., chips), data, talent, and compute infrastructure required to develop and deploy AI systems in practice (Widder et al.,, 2023; Srnicek,, 2022; Lehdonvirta,, 2023; Lehdonvirta et al.,, 2023; Don-Yehiya et al.,, 2024). One may argue that no matter how much AI software or how many AI models companies release publicly, regardless of their license choice, such AI democratisation efforts will do little to fundamentally reconfigure the distribution of power and resources in the wider AI industry.

This prior work provides insights into the open source AI strategies of industry giants and concerns related to open-washing. However, we still have significant gaps in our understanding of why and how different types of companies, beyond industry giants, employ open source as a means to “democratise” AI. To address this gap, in the next section, I draw on the extensive literature on the political economy of OSS, which provides a comprehensive theoretical foundation for understanding commercial incentives for AI democratisation efforts that are facilitated by open source technology.

2.2 Commercial Incentives for OSS Development

Companies have participated in the development of OSS in a myriad of ways since the late 1990s (Broca,, 2021; Li et al.,, 2024), including by deploying developers to contribute to projects as part of their job responsibilities or corporate social responsibility initiatives (Dahlander and Magnusson,, 2005; Dahlander and Wallin,, 2006; Lee and Herstatt,, 2015; Zhang et al.,, 2021), funding projects (O’Brien,, 2019; Osborne,, 2024), or joining project steering committees (Butler et al.,, 2018; Wagstrom,, 2009), among others. These are popular strategies through which companies seek to influence projects that develop maintain OSS that they use Dahlander and Wallin, (2006); Osborne, (2024). It is also common for companies to spin-out proprietary software as company-hosted OSS, where the host company controls the intellectual property of the project (e.g., by requiring contributors to sign a contributor license agreement) and employs the maintainers of the project (Zhou et al.,, 2016; Yue and Nagle,, 2024). This is a proven commercial strategy to increase adoption of their software, benefit from external contributions, win market share, or reduce a competitor’s market share (West and O’Mahony,, 2005). In some cases, a handful of companies share control of a project; for example, in 2017, Facebook and Microsoft jointly released the Open Neural Network Exchange (ONNX) to enable interoperability between various deep learning frameworks like PyTorch and TensorFlow (Candela,, 2017).

An extensive literature discusses the diverse incentives for commercial adoption and development of OSS at both the individual level (see Table 1) and the organisational level (see Table 2). Bonaccorsi and Rossi’s (2006) taxonomy of social, economic, and technological incentives at the individual and organisational levels provides an enduring framework for categorising these diverse incentives. In addition, they find that divergent incentives between individuals and organisations. While individuals are mostly driven by social and technological incentives, such as their personal interest (Benkler,, 2006; Raymond, 2001a, ; von Krogh et al.,, 2012), values (Kelty,, 2008; Lakhani and von Hippel,, 2003; Shah,, 2006), or needs (Franke and Hippel,, 2003; Hars and Ou,, 2002; Roberts et al.,, 2006), companies are chiefly motivated by economic and technological factors, such as influencing industry standards (Lerner,, 2002; Lindman et al.,, 2009), reducing development costs (Birkinbine,, 2020; Chesbrough,, 2023), and recruiting external developers (Agerfalk and Fitzgerald,, 2008; Fink,, 2003). However, the incentives of individuals vary based on factors such as whether they are volunteers or paid (Lakhani and Wolf,, 2003; Dahlander and Wallin,, 2006), the governance structure of the OSS project (Shah,, 2006), and their geography and cultural norms (Subramanyam and Xia,, 2008; Takhteyev,, 2012), among others. While individual developers, including volunteers, do not share the primary incentives of for-profit companies, they tend to accept commercial participation on condition that they comply with community norms (Bonaccorsi and Rossi,, 2006). Furthermore, commercial participation in OSS projects can also attract volunteers, who see their presence as a signal of the complexity of the project (O’Mahony and Bechky,, 2008; Santos et al.,, 2013).

Table 1: Incentives for Individuals to Participate in OSS Development
Category Sub-category References
Social Fun & geek culture Torvalds, (2001); Raymond, 2001a ; Gerosa et al., (2021); Lakhani and Wolf, (2003); Lakhani and von Hippel, (2003); Benkler, (2002, 2006); Hertel et al., (2003); Luthiger and Jungwirth, (2007); Roberts et al., (2006); Shah, (2006); von Hippel, (2003); Xu et al., (2009); Hemetsberger, (2002)
Reputation & peer recognition Raymond, 2001a ; Gerosa et al., (2021); Bezroukov, (1999); Ghosh et al., (2002); Lakhani and Wolf, (2003); Lakhani and von Hippel, (2003); Lerner, (2002); Fershtman and Gandal, (2004); Ghosh, (2007, 1998); Hars and Ou, (2002); Hertel et al., (2003); Lattemann and Stieglitz, (2005); Lerner and Tirole, (2002); Okoli and Oh, (2007); Oreg and Nov, (2008); Roberts et al., (2006); Spaeth et al., (2008); Stewart and Gosain, (2006); Xu et al., (2009); Yu et al., (2007); Hemetsberger, (2002); Loebbecke and Angehrn, (2003); Krishnamurthy, (2006); Howison and Herbsleb, (2013)
Altruism & reciprocity Raymond, 2001a ; Gerosa et al., (2021); Raymond, 2001b ; Bergquist and Ljungberg, (2001); Bitzer et al., (2007); Crowston and Scozzi, (2002); Hars and Ou, (2002); Osterloh et al., (2003); David and Shapiro, (2008); David et al., (2003); Ghosh and Schmidt, (2006); Haruvy et al., (2003); Lakhani and Wolf, (2003); Oreg and Nov, (2008); Schofield and Cooper, (2006); Shah, (2006); Stewart and Gosain, (2006); Wu et al., (2007); Yu et al., (2007); Nagle et al., (2020); Ding et al., (2023)
Kinship with OSS community Pfaff and David, (1998); Gerosa et al., (2021); Raymond, 2001b ; Hars and Ou, (2002); David and Shapiro, (2008); Hertel et al., (2003); Lakhani and von Hippel, (2003); Schofield and Cooper, (2006); Zeitlyn, (2003); Bagozzi and Dholakia, (2006); Hemetsberger, (2002)
Ideology & personal values Stallman, (1984); Raymond, 2001a ; Gerosa et al., (2021); Kelty, (2008); Takhteyev, (2012); Schoonmaker, (2018); David and Shapiro, (2008); David et al., (2003); Ghosh and Schmidt, (2006); Hertel et al., (2003); Lakhani and von Hippel, (2003); Shah, (2006); Stewart and Gosain, (2006); Xu et al., (2006); Yu et al., (2007); Akiki et al., (2022)
Economic Payment & sponsorship Lerner, (2002); Feller and Fitzgerald, (2002); Hertel et al., (2003); Hars and Ou, (2002); Ghosh and Schmidt, (2006); Lakhani and Wolf, (2003); Lattemann and Stieglitz, (2005); Luthiger and Jungwirth, (2007); Roberts et al., (2006); Howison and Herbsleb, (2013)
Job-seeking & career benefits Lerner and Tirole, (2001, 2002); Lakhani et al., (2002); Lakhani and Wolf, (2003); Gerosa et al., (2021); Ghosh et al., (2002); Subramanyam and Xia, (2008); Hossain, (2021); Hann et al., (2002); Hars and Ou, (2002); Hertel et al., (2003); Riehle, (2007); Roberts et al., (2006); Shah, (2006); Yu et al., (2007)
Low opportunity costs for use Bonaccorsi and Rossi, (2003); Kollock, (1999); Lakhani and Wolf, (2003)
Technological Learning & skill development Bonaccorsi and Rossi, (2006); Lakhani and Wolf, (2003); Ghosh et al., (2002); David and Shapiro, (2008); David et al., (2003); Hars and Ou, (2002); Gerosa et al., (2021); Roberts et al., (2006); Oreg and Nov, (2008); Shah, (2006); Stewart and Gosain, (2006); Spaeth et al., (2008); von Hippel, (2003); Wu et al., (2007); Ye and Kishida, (2003); Hemetsberger, (2002); Ding et al., (2023)
Use state-of-the-art software Pfaff and David, (1998); Pavlicek, (2000); Green, (2000); Franke and Hippel, (2003); Feller and Fitzgerald, (2002); Ding et al., (2023)
Personal need for software Raymond, 2001a ; von Hippel, (2003, 2001); Hars and Ou, (2002); Osterloh et al., (2003); Bitzer and Schroder, (2006); Ghosh et al., (2002); Hertel et al., (2003); Lattemann and Stieglitz, (2005); Schofield and Cooper, (2006); Shah, (2006); Kuan, (2001); David and Shapiro, (2008); Ghosh and Schmidt, (2006)

N.B. This table was adapted from Bonaccorsi and Rossi, (2006) and updated with more recent publications.

Table 2: Incentives for Companies to Participate in OSS Development
Category Sub-category References
Social Conformity with OSS values Lerner, (2002); Osborne, (2024)
Reciprocity to OSS community Feller and Fitzgerald, (2002); Franck and Jungwirth, (2002); Osterloh et al., (2003); Lerner and Tirole, (2002); Germonprez et al., (2013)
Reputation as OSS patron Osterloh et al., (2003); Lerner and Tirole, (2002); Pitt et al., (2006)
OSS values (strategy to reduce market power of competitors) Feller and Fitzgerald, (2002)
Economic Reduce development costs Agerfalk and Fitzgerald, (2008); Birkinbine, (2020, 2018); Chesbrough, (2023); Feller and Fitzgerald, (2002); Hawkins, (2004); Lindman et al., (2009); Markus et al., (2000); Marsan et al., (2012); Woods and Guliani, (2005); Kendall et al., (2016); Germonprez et al., (2013); Lakhani et al., (2002); Linåker et al., (2016); Loebbecke and Angehrn, (2003); Nguyen Duc et al., (2017); Nguyen-Duc et al., (2019); Teixeira, (2014); Teixeira and Lin, (2014); Teixeira et al., (2015); Li et al., (2022)
Independence from price and license policies of vendors Chesbrough, (2023); Lerner and Tirole, (2002)
Revenue generation from complementary services & products Chesbrough, (2023); Feller and Fitzgerald, (2002); Lindman et al., (2009); West, (2003); Wichmann, (2002)
Recruit software engineers Agerfalk and Fitzgerald, (2008); Fink, (2003); Lerner and Tirole, (2002); Mugrage, (2022); Wichmann, (2002); Marlow and Dabbish, (2013)
Increase market competitiveness Ahlawat et al., (2021); Chesbrough, (2023); Hossain, (2021); Lindman et al., (2009); Loebbecke and Angehrn, (2003); Woods and Guliani, (2005); Kendall et al., (2016)
Technological Crowdsource innovation Agerfalk and Fitzgerald, (2008); Birkinbine, (2020, 2018); Chesbrough, (2023); Feller and Fitzgerald, (2002); Hawkins, (2004); Lindman et al., (2009); Markus et al., (2000); Marsan et al., (2012); Woods and Guliani, (2005); Kendall et al., (2016); Germonprez et al., (2013); Lakhani et al., (2002); Linåker et al., (2016); Teixeira, (2014); Li et al., (2022)
Enhance software security Alrawashdeh et al., (2020); Bonaccorsi and Rossi, (2006); Birkinbine, (2020); Chesbrough, (2023); Fink, (2003); Widder et al., (2023); Hecker, (1999); Hawkins, (2004); Henkel, (2006); von Hippel, (2003); Lerner and Tirole, (2002); Lindman et al., (2009); Krishnamurthy, (2005); Germonprez et al., (2013)
Increase adoption of software Lerner and Tirole, (2002); Wagstrom, (2009); Widder et al., (2023); Srnicek, (2022)
Promote open standards
and technical interoperability
Chesbrough, (2023); Fink, (2003); Lerner and Tirole, (2002); Lindman et al., (2009); Wichmann, (2002); West, (2003)
Influence the technical roadmap
& direction of OSS projects
Dahlander and Wallin, (2006); Lakhani and Wolf, (2003); Linåker and Runeson, (2020); Lindman et al., (2009); von Krogh et al., (2012); Kendall et al., (2016)

N.B. This table was adapted from Bonaccorsi and Rossi, (2006) and updated with more recent publications.

For companies, economic and technological incentives are the most salient. In particular, the distributed production model of OSS development, involving many more skilled developers beyond those within the organisational boundaries of any single company, is viewed as means to decrease in-house R&D costs (Lindman et al.,, 2009; Agerfalk and Fitzgerald,, 2008). Benjamin Birkinbine argues that the greatest value of OSS for companies stems from the peer production model that expands the labour force that can test and develop the software. Specifically, he contends that the value for companies stems from the processes, not the products, of OSS development (Birkinbine,, 2020). These incentives were underscored in the aforementioned Google memo, which urged the company to “own the ecosystem and let open source work for us” (Patel and Ahmad,, 2023). The extent of volunteer activity in OSS development, from bug-spotting to code contributions (Feller and Fitzgerald,, 2002; Roberts et al.,, 2006; Zhou et al.,, 2016), raises ethical questions about the exploitation of volunteer work (Li et al.,, 2022) and the failure of most companies to adequately reciprocate to support the sustainability of OSS projects (Birkinbine,, 2020; Osborne,, 2024). For example, while Linus’ Law—i.e., that “Given enough eyeballs, all bugs are shallow” (Raymond, 2001a, )—is typically quoted to argue that the OSS development model offers security advantages over proprietary software development, one can extend it to convey the value of distributed bug-spotting and improvements that no single company must pay for by themselves.

While volunteers make useful contributions to OSS projects (Barcomb et al.,, 2020; Capiluppi and Michlmayr,, 2007; Crowston and Scozzi,, 2008; Zhou and Mockus,, 2015), it is often the case that companies specifically seek to collaborate with other companies, including market rivals, as a means to jointly share R&D costs (Zhang et al.,, 2020; Teixeira and Lin,, 2014; Teixeira et al.,, 2016; Nguyen-Duc et al.,, 2019) and shape industry standards (Lerner and Tirole,, 2001, 2002; West and O’Mahony,, 2005). While inter-company collaborations do not necessarily exclude volunteers, it is not uncommon for companies to engage in strategic or contractual collaborations, involving private collaboration methods, which volunteers cannot participate in (Osborne et al, forthcoming). The prevalence of inter-company collaborations has turned many OSS communities “from networks of individuals into networks of companies” (Agerfalk and Fitzgerald,, 2008), resulting in a tangle of cooperation and competition between companies that has become known as “open source co-opetition” (Zhang et al.,, 2020; Teixeira and Lin,, 2014; Teixeira et al.,, 2016; Nguyen-Duc et al.,, 2019). Moreover, commercial participation in OSS development can help companies to improve their market position by undercutting the product of a market rival (Fink,, 2003; West and O’Mahony,, 2005) as well as to enhance their reputation as an OSS patron among developers (Feller and Fitzgerald,, 2002; Bonaccorsi and Rossi,, 2006; Pitt et al.,, 2006), which in turn can help to recruit software developers (Agerfalk and Fitzgerald,, 2008; Lindman et al.,, 2009).

The diversity of incentives of various stakeholders underlines the critical role of governance in OSS projects (O’Mahony and Bechky,, 2008). Non-profit foundations have emerged as key mediators—or “boundary organisations”—whose vendor-neutrality and open community governance have proven to be effective structural enablers of collaboration between “unexpected allies” (O’Mahony and Bechky,, 2008). For instance, the LF is reputed to facilitate “communities of competitors,” where “market rivals…intentionally coordinate activities for mutual benefit in precise, market-focused, non-differentiating engagements” (Germonprez et al.,, 2013). Foundations limit the dominance of any single company in OSS projects, which attracts new contributors to projects (Di Giacomo et al.,, 2020; West and O’Mahony,, 2005; Link and Germonprez,, 2016), especially volunteers who are hesitant about performing free work for a company (Zhang et al.,, 2018, 2021; Zhou et al.,, 2016), and increases their adoption (Zhou et al.,, 2016). However, foundations do not always prevent commercial dominance in OSS projects (Wagstrom,, 2009; Zhang et al.,, 2018, 2022). For instance, around 10% of companies account for 80% of commits to projects in the OpenStack ecosystem (Zhang et al.,, 2021). Moreover, governance changes resulting from donations do not guarantee activity increases. For example, a study of PyTorch’s governance transition from Meta to the LF revealed no net increase in project activity and specifically that contributions from Meta decreased significantly, those from users (e.g., app developers and cloud providers) remained unchanged, but those from complementors (e.g., chip manufacturers) increased (Yue and Nagle,, 2024). While governance changes may address “hold-up” problems for certain companies, particularly for complementors whose value capture proposition depends on interoperability, they do not guarantee net increases in external contributions (Yue and Nagle,, 2024).

This review has provided a theoretical foundation for examining commercial incentives in AI democratisation efforts. The taxonomy of social, economic, and technological incentives at individual and organisational levels (Bonaccorsi and Rossi,, 2006) offers a valuable framework for this study’s exploratory aims. By applying this framework to AI OSS donations, this study aims to identify and categorise the commercial interests driving AI democratisation. The following section outlines the research aims and methodological approach in more detail.

3 Study Design

3.1 Research Aims

The objective of this study is to identify and categorise commercial interests for AI democratisation and thus contribute to advancing the nascent research agenda on the political economy of open source AI (Widder et al.,, 2023; Srnicek,, 2022; Liesenfeld and Dingemanse,, 2024). Specifically, it examines the following research question (RQ): Why do companies democratise AI? Given the various methods of AI democratisation (Seger et al., 2023b, ), it focuses on AI OSS donations to foundations—that is, the transfer of an OSS project from a company’s ownership to a non-profit foundation (O’Mahony and Ferraro,, 2007)—which in the AI industry are commonly presented as acts of AI democratisation. While this narrow scope enables an in-depth analysis of one method of AI democratisation, it naturally limits the generalisability of the findings to others, such as the increasingly common releases of open models (see Section 5.3).

Within this scope, a mixed-methods approach was employed to investigate the incentives for 43 OSS donations between May 2018 and October 2022 by a range of companies, from startups to multinational corporations, to the LF AI & Data Foundation and PyTorch Foundation, two foundations under the LF that host OSS for data science and AI. The range of projects and companies form a diverse sample (see Table LABEL:tab:donations-projects), accounting for various project maturity levels, company sizes, sectors, and countries (Runeson and Höst,, 2008; Easterbrook et al.,, 2008). Furthermore, the mixed-methods approach to multiple cases mitigates the uniqueness of single cases (Herriott and Firestone,, 1983; Eisenhardt,, 1989) and data sources or methods (Yin,, 2018; Lehdonvirta et al.,, 2019), thus enhancing the validity of the findings. The study received ethical clearance by the University of Oxford CUREC review board prior to data collection.

3.2 Case Presentation

3.2.1 LF AI & Data Foundation

The LF AI & Data Foundation was founded in March 2018 as the LF Deep Learning Foundation and rebranded as the LF AI Foundation in May 2019, broadening its scope to encompass various AI sub-fields. In October 2020, it merged with the ODPi, an organisation promoting a big data software ecosystem. The foundation subsequently rebranded as the LF AI & Data Foundation, acknowledging the vital role of data in AI R&D. At the point of data collection (October 2022), the foundation had 51 member companies from North America, Europe, and East Asia. It hosted 42 OSS projects that had been donated by diverse organisations, including startups (e.g., AI Squared), research institutes (e.g., the Beijing Academy of AI), consortia (e.g. ONNX), management consultancies (e.g., McKinsey & Co.), and technology giants (e.g., IBM, Samsung, and Tencent). When a company seeks to donate an OSS project to the LF AI & Data Foundation, they must be a member organisation of the LF or be endorsed by a member and submit their proposal for review by the technical advisory council (TAC). The TAC comprises representatives from the various projects at the foundation and premier member companies, who vote on the approval of donation proposals. The LF AI & Data Foundation segregates business and technical governance of hosted OSS projects, ensuring that developers retain technical control in their projects whilst the foundation assumes responsibility for funding, marketing, and license compliance, among others, and enforces open community governance (Dolan,, 2023).

3.2.2 PyTorch Foundation

The PyTorch Foundation was established in September 2022 to host the popular PyTorch deep learning framework that had been donated by Meta (PyTorch,, 2023). Its mission is to “driv[e] the adoption of AI tooling by fostering and sustaining an ecosystem of open source, vendor-neutral projects integrated with PyTorch” and “to democratise state-of-the-art tools, libraries, and other components to make these innovations accessible to everyone” (PyTorch,, 2023). The PyTorch Foundation similarly maintains a separation between business and technical governance for the PyTorch project and wider ecosystem, with the PyTorch project retaining its technical governance structure while the foundation is responsible for funding, hosting expenses, and events, among others. The PyTorch Foundation manages the project’s assets, including its website, GitHub repository, and social media accounts, and enforces open community governance. Upon its launch, it formed a governing board comprising representatives from its initial members: AMD, Amazon Web Services, Google Cloud, Hugging Face, IBM, Intel, Meta, Microsoft, and Nvidia (PyTorch,, 2023). The governing board members shape PyTorch’s strategic direction through voting rights, contribute to the project’s technical development and roadmap, and gain benefits such as early feature access and increased visibility in the PyTorch ecosystem, while being expected to actively support and promote PyTorch’s growth and adoption.

3.3 Data & methods

3.3.1 Data Collection

This study comprises two sources of primary data and two sourcs of secondary data. First, two types of secondary data were collected from the Internet for 43 AI OSS donations to the LF (42 donations to the LF AI & Data Foundation and 1 donation to the PyTorch Foundation). First, pre-donation technical pitches to the TAC were collected from the LF AI & Data Foundation wiki page (LFAI&Data,, 2023). Second, post-donation blog posts by the LF and respective companies were collected from the LF AI & Data Foundation website and through Google search queries in the format of “[company name] + [project name] + [LF AI & Data Foundation]. This yielded 40 presentations (95%) and 37 (88%) blog posts for the 42 projects donated to the LF AI & Data Foundation. It was not possible to collect a pre-donation technical pitch for the PyTorch donation because, as an inaugural project of its namesake foundation, it did not follow this process. Blog posts were accessed via the PyTorch Foundation website and Google search queries using the aforementioned format. A full list of the 43 OSS projects, donors, and respective document links is provided in Table LABEL:tab:donations-projects.

Subsequently, primary data was collected through questionnaires and 12 semi-structured interviews with ten project maintainers who had donated the project and two foundation employees. First, a brief questionnaire was distributed to the maintainers to gather information on the donation process, incentives, and outcomes, as well as to recruit interviewees. It was distributed via the LF AI & Data Foundation’s mailing list and to the PyTorch maintainers via the Executive Director of the PyTorch Foundation, resulting in 16 responses from the maintainers of 16 projects at the LF AI & Data Foundation and 0 responses from the maintainers of PyTorch (in total, 37% of the 43 projects). Ten maintainers were recruited for interviews through the questionnaire, who worked for 9 companies, diverse by geography, size, and sector (see Table 3). In addition, the Executive Director of the foundations (N.B. same person) and a LF AI & Data Foundation project coordinator were interviewed. The 12 interviews lasted between 30 and 60 minutes and were semi-structured, combining standardised questions about the donation process with tailored questions based on their questionnaire responses, adding depth to the quantitative findings in Figure 1. The interviews were conducted digitally, and were recorded to aid analysis and to enhance the validity of the research findings (Yin,, 2018).

3.3.2 Data Analysis

Thematic analysis was applied to systematically identify commonalities, patterns, and relationships in the qualitative data (Cruzes and Dyba,, 2011). A systematic six-step procedure was followed to enhance the reliability of this analysis (Braun and Clarke,, 2006). The initial coding procedure involved an integrated approach, combining the inductive coding (Charmaz,, 2006) and a deductive coding informed by prior work on incentives that exist at the levels of individual developers (see Table 1) and organisations (see Table 2). This approach allowed for the identification of both commonalities with prior work and novel findings concerning open source AI democratisation efforts. The coding was conducted by the author until reaching saturation (Charmaz,, 2006), then merged codes into 25 distinct themes (i.e., social, economic, and technological incentives at the level of individual developers and companies shown in Table 4). To address the limitation of single-author analysis, each step was rigorously documented, and the results were member-checked with the interviewees and discussed with two academic advisers (Edwards and Holland,, 2013; King,, 2009). Furthermore, the interviewees were invited to review the quotes attributed to their anonymised IDs and to state their attribution preference, ensuring consent for the inclusion of statements. Only three interviewees proposed revisions (e.g., to enhance specificity), indicating the resonance of the findings with the practitioners.

Table 3: List of Interviewees & Affiliations
ID Project Role Company Size Sector
A BeyondML Maintainer Small Information technology
B Elyra Maintainer Large Information technology
C Elyra Maintainer Large Information technology
D Kedro Maintainer Large Professional services
E Kedro Maintainer Large Professional services
F KServe Maintainer Large Information technology
G Ludwig Maintainer Small Information technology
H NNstreamer Maintainer Large Information technology
I ONNX-MLIR Maintainer Large Information technology
J ONNX-MLIR Maintainer Medium Information technology
K LF AI & Data Project coordinator Medium Non-profit foundation
L LF AI & Data,
PyTorch Foundation
Executive director Medium Non-profit foundation

3.3.3 Reflexivity

In social science research, it is critical that one as a researcher engages in critical self-evaluation of the one’s positionality and disiplinary conventions, and how they may influence one’s research, from its initial design through to the reporting of and presentation of research findings (Finlay,, 2002). The exercise of reflexivity was particularly important for the credibility of this study, given the author’s affiliation with the LF as a research contractor. A social identity map was employed as a tool to encourage reflection on positionality and to address potential biases in three areas (Jacobson and Mustafa,, 2019). First, at the outset, it was used to consider the effects of the LF affiliation on data access and potential threats to reproducibility, recognising that the LF affiliation likely increased the willingness of foundation staff and project maintainers to participate in the study. To enhance reproducibility, the data collection strategy relied primarily on publicly available information (e.g., mailing lists) and information sheets sent with invitations explicitly stated the independent purpose and funding for the research. Second, to minimise social desirability bias in interview responses, the research’s independent purpose and funding mentioned in the information sheet were explained to interviewees at the beginning of each interview. Third, a journal was maintained during the thematic analysis to document coding choices, and the findings were shared with two academic advisers to review the interpretations. While these actions contribute to enhancing the credibility and reliability of the research process, it must be acknowledged that it is difficult, if not impossible, to perform bias-free research and, therefore, it should be understood as an imperfect, yet best-effort attempt by the author to control for and mitigate potential biases in the research process (Jacobson and Mustafa,, 2019).

4 Results

The findings reveal an interplay of social, economic, and technological incentives at both individual and organisational levels for OSS donations to foundations (see Table 4). This section presents these findings, contributing a more nuanced understanding of the bottom-up and top-down incentives behind AI democratisation efforts by companies.

4.1 Individual-level Incentives

Individual employees often play a crucial role in championing and coordinating the donation process within their companies. 38% of questionnaire respondents stated that the decision to donate was driven from the bottom up by developers, while 13% of respondents stated the decision was made by individuals who held both developer and managerial roles (e.g., startup founders). These findings underscore the importance of understanding individual incentives in shaping commercial decisions to donate OSS projects to foundations.

4.1.1 Social Incentives

At the individual level, social incentives are as a significant driver for OSS donations. Two key themes stood out: reciprocity and personal reputation. Many developers expressed a deep-seated ethos of ‘giving back” to the community that supports their work. For instance, Respondent A (BeyondML) stated:

The vast majority of proprietary models and software in data science and machine learning are built on open source, so being part of that and contributing to that is really important to me personally and to our company.

This sentiment was echoed by Respondent H (NNStreamer), who noted that their donation was “just for our own satisfaction” as OSS users and developers. These responses highlight the personal investment many developers have in the OSS ecosystem and their desire to contribute to its growth and sustainability. Personal reputation is also a significant social incentive, with successful donations to major foundations representing an opportunity for individual developers to enhance their standing in the OSS community. For example, Respondent J (ONNX-MLIR) explained that having a project accepted by a major foundation provides credibility among peers and brings developers like himself closer to their “dream of having your big open source project with 1000s of stars.” Respondent I (ONNX-MLIR) added that improving one’s reputation also leads to career benefits, as one can be hired based on one’s reputation, and that individuals’ aspirations align with company’s goals to improve their corporate reputation as an OSS-friendly workplace.

4.1.2 Economic Incentives

Two primary economic incentives at the individual level are career benefits and access to foundation support services.The reputational gains from OSS contributions often translate into tangible career benefits. For example, respondent I (ONNX-MLIR) noted that achievements in OSS projects make developers more competitive in the AI job market, as their expertise at the intersection of software engineering and AI becomes both known and knowable in the wider open source AI ecosystem. This enhanced visibility and credibility in the job market creates a powerful incentive for individuals to support OSS donations. As mentioned above, Respondent I (ONNX-MLIR) pointed out that these incentives of individual developers also align with organisational goals of fostering a skilled workforce, attracting and recruiting competitive AI talent, and improving their corporate reputation among OSS developers.

The desire to harness foundation support services is another important economic incentive. Many developers viewed foundation support as a means to address challenges faced by maintainers in managing projects alongside their full-time employment. 87% of respondents claimed this support was important for them (see Figure 1). Respondent F (KServe) elaborated on this point:

We, as developers, don’t have a lot of time for [outreach and marketing]. We sought to benefit from support services, such as outreach to new contributors and marketing.

Similarly, Respondent H (NNStreamer) sought marketing support to increase project visibility and to attract external developers. These examples illustrate how foundations are perceived to provide resources that individual maintainers often lack or cannot secure within their own companies, which in turn helps to enhance the growth of their OSS projects.

4.1.3 Technological Incentives

Technological incentives at the individual level include ensuring project sustainability and enabling the use of collaboration tools. Project sustainability is a significant concern for maintainers. Respondent G (Ludwig) provided a compelling example, describing how he donated his project to ensure its survival following organisational restructuring and his personal departure from the company. He viewed the transition to a foundation as an effective strategy to ensure the project’s continuation. This case demonstrates how personal attachment to projects, which he described as his “baby”, can drive individuals to seek sustainable governance models for their OSS projects, especially when their affiliation with the company comes to an end. It also highlights the use of OSS donations as a mechanism to preserve source code that might otherwise be lost due to corporate changes or neglect.

The ability to use preferred collaboration tools is another technological incentive. Respondents D and E (Kedro) explained that transitioning the project to the LF AI & Data Foundation made it easier to use tools like Slack and Discord, which were either forbidden or difficult to get approval for at their company. As Respondent E (Kedro) explained, “It untied our hands from our own bureaucracy.” They elaborated that this freedom from corporate constraints concerning collaboration tools not only enhances developer productivity and satisfaction but also aligns with broader organisational goals of fostering innovation and efficiency, thus representing a win-win scenario for them.

4.2 Organisational-level Incentives

While individual employees play a crucial role in championing OSS donations within their companies, organisational incentives (i.e., corporate strategies) ultimately matter above all in the decision-making process, with 44% of respondents stating that the donation was a top-down decision by managers and 13% respondents stating that the decision was made by individuals who held both developer and managerial roles within their company.

4.2.1 Social Incentives

At the organisational level, three primary social incentives emerged: adopting open governance, reciprocating to the OSS ecosystem, and building the company’s reputation. The adoption of open governance upon donating a project to a foundation is a salient incentive, with 81% of respondents reporting it as important. This change in governance model is viewed as a structural enabler for downstream goals. For example, Detakin’s press release highlighted this incentive:

The LF AI & Data provides a vendor-neutral governance structure that can help the project grow broad industry collaboration. Even more importantly, becoming a LF AI & Data project ensures that OpenLineage can never belong to a company.

Similarly, Lyft emphasised the importance of a “neutral holding ground” when donating both Amundsen and Flyte. These statements underscore the perceived value of neutral governance in fostering collaboration and ensuring the independence of projects.

Similar to individuals, reciprocity to the OSS ecosystem is a key social incentive at the organisational level. Several respondents cited the critical importance of OSS dependencies in their companies’ proprietary products and services, and perceived the donation of their OSS project as one way to “give back”, as explained by Respondent A (BeyondML). In a similar vein, Respondent C (Elyra) underscored the impact of OSS for advances in AI as a reason for why he champions giving back to the ecosystem:

The democratisation of AI software is really what is helping industry advance. If you look back like 10-20 years ago, it was very hard and you needed to have a specific set of skills to be able to even build a very simple model. Today with all the tools and stuff, it’s much easier.

Respondent C (Elyra) explained that their team’s decision to donate Elyra stemmed in part from their desire to play their part in advancing industry. This sentiment was echoed in several companies’ post-donation press releases, framing their donations as acts of AI democratisation. For instance, Uber stated that by donating Pyro, it hoped “to facilitate greater opportunities for researchers worldwide and [to] make deep learning and Bayesian modelling more accessible.”

Building corporate reputation emerged as the third significant social incentive at the organisational level. Many respondents explained they hoped that the reputation of the LF AI & Data would enhance their company’s credibility in the open source AI ecosystem, with 75% of respondents reporting it as important. This incentive was particularly strong for startups and companies without an established reputation in AI. Respondent A (BeyondML) explained:

One of the things that we hoped to get from the LF is its name recognition, obviously just about every developer in the world knows about the LF or knows the term Linux. So having that kind of badge, if you will, immediately gives you a level of credibility with your project.

Respondents D and E (Kedro) echoed this sentiment, stating in their pre-donation pitch that they “would like to leverage the initial marketing announcements to build credibility in their technical and product-related capability”. Subsequently, in the post-donation press release, their company stated:

It’s a substantial step forward for our organisation on our open source journey. The consultancy sector has traditionally been highly protective of intellectual property, but it’s clear that open, collaborative innovation will help power the next phase for analytics technology.

These findings highlight three key social incentives driving organisational decisions to donate AI OSS projects. The adoption of open governance emerges as a key structural enabler for fostering collaboration and ensuring project independence. Reciprocity to the OSS ecosystem reflects companies’ recognition of their dependence on open source technologies and their desire to contribute to industry advancement. Additionally, the opportunity to enhance corporate reputation, particularly for less established companies, serves as a salient motivator. These social incentives are strategically important for companies seeking to position themselves favourably within the open source AI ecosystem.

4.2.2 Economic Incentives

Several economic incentives at the organisational level inform the decision to donate OSS projects, including attracting external contributors, reducing development costs, diversifying project funding, and harnessing foundation support services. Indeed, being able to attract new contributors to their project is a critical incentive for OSS donations, with 100% respondents reporting it as important. Respondent J (ONNX-MLIR) described OSS donations as a strategic trade-off, where companies exchange full control of their OSS project for the aspired for benefits of distributed development involving a community of contributors. He noted the self-interested logic:

We continue doing the same work as we would if it wasn’t an open source project, but there’s this expectation that we’re going to benefit from a community helping us achieve our own goals.

Respondent G (Ludwig) explained that above all it is beneficial for attracting contributors from other companies, who “prefer not to contribute to projects that are started by companies that could be competitors. They don’t trust it as fully open source if it was started by Uber, Google, Facebook, or whatever company.” Meanwhile, Respondent B (Elyra) provided a more nuanced view:

In reality, in all the projects that I’ve seen, they are still driven by the main inventors. Open governance just means that the feedback comes from additional contributors … It’s more like a community project.

However, respondents cautioned that changing the governance model does not guarantee more or useful external contributions. Respondent D (Kedro) explained she had rejected pull requests “because they were not up to scratch”, while Respondent G (Ludwig) discussed the role of mentorship in training external developers to become effective contributors, which he highlighted requires the company to invest time and money.

Diversification of funding is another significant economic incentive. For example, respondent E (Kedro) discussed the relevance of client concerns about the financial dependence on the host company:

Clients would ask, ’What will happen if your team does not exist tomorrow?’ They were afraid that if we left the code, they wouldn’t be able to get new versions and then it would become unmaintained.

Respondent D (Kedro) speculated that the only reason they were able to convince their senior management to approve the open-sourcing of Kedro was due to this client pressure. In addition, some companies seek to attract or increase external investment in their project. Several respondents saw this as a primary reason for Meta’s donation of PyTorch, speculating that members of the newly founded governing board of the PyTorch Foundation would likely invest more resources in PyTorch than they did before.

Harnessing foundation support services also emerged as a significant economic incentive at the organisational level, with 87% of respondents reporting it as important. For example, Respondent A (BeyondML) described the resources and infrastructure provided by foundations as “stability offerings” that could help scale and sustain projects beyond the means of their startup, thus making OSS donations an attractive option for sustainable project growth.

4.2.3 Technological Incentives

At the organisational level, several technological incentives were identified, including ecosystem integration and adoption, software improvements, faster innovation, and influencing industry standards. Respondents highlighted that joining a foundation offered ecosystem benefits. For example, respondent C (Elyra) noted:

Together with [open governance], we thought being in an ecosystem of other machine learning and AI projects would foster collaboration and integration, exposing Elyra to more use cases.

This perspective illustrates how companies view foundation ecosystems as platforms for enhanced collaboration and project visibility, potentially leading to more adoption and more diverse applications of their software. This was confirmed by 88% of respondents who reported increasing adoption as an important incentive for their company.

Software improvements and faster innovation were also identified as significant technological incentives. Respondent B (Elyra) observed that “That’s something which is one strength of open source; it’s much more properly tested than [proprietary] software.” An additional incentive is the aim to accelerate development, implicitly thanks to new contributors who would join the project, with 69% of respondents reporting it as important. This underlines the aforementioned perspective of Respondent J (ONNX-MLIR) that AI democratisation is in large part self-interested, using open source as a means to reduce internal costs, speed up innovation, and enhance software quality.

Influencing industry standards emerged as another key technological incentive, with 63% of respondents reporting it as important for their company. A number of respondents speculated that the ambition behind Meta’s donation of PyTorch was to make it the open standard for deep learning, bringing an end to the age-old rivalry between PyTorch and Google’s TensorFlow. Respondent E (Kedro) commented:

I think that move was actually made to destroy TensorFlow because TensorFlow [does not have] open governance.

Respondent D (Kedro) suggested that Meta was seeking to beat TensorFlow out of the market since Google would struggle to compete with the strategic alliance of industry giants that had formed under the PyTorch Foundation’s governing board (which even includes Google). Respondent B (Elyra) was more direct, describing it as “a death knell to TensorFlow.” Reflecting on their experience of overseeing donations to the foundation, Respondent L (LF) explained:

Every company may have a different set of incentives but what’s common across all of them is the desire to make sure that the project becomes successful in the long term and becomes the de facto project for its given functionality.

These findings reveal a complex interplay of social, economic, and technological incentives driving companies to donate AI OSS to foundations. The incentives span both individual and organisational levels, highlighting the multifaceted nature of AI democratisation efforts. These insights provide a nuanced understanding of the strategies and considerations that shape commercial decisions, practices, and strategies in the open source AI ecosystem.

Refer to caption
Figure 1: Commercial Incentives for AI OSS Donations to the Linux Foundation

Sample: 16/43 (37%) OSS projects hosted at LF AI & Data Foundation and PyTorch Foundation in October 2022.

5 Discussion

5.1 Implications for Research

5.1.1 Disambiguating “AI Democratisation” Narratives

The findings shed light on the inconsistent use of “AI democratisation” in commercial narratives surrounding OSS donations, extending the work of Seger et al., 2023b . They show that in the context of OSS donations, the most relevant interpretation is the democratisation of governance, as companies transfer already open-sourced software from their single-vendor governance to open governance provided by vendor-neutral foundations. This transition of governance and control rights, however, is not an end in itself but a strategy that aims to realise goals, such as increasing adoption and recruiting new contributors to the project. It is often the companies seek, in particular, participation by other companies, who previously would hesitate to contribute to a company’s OSS project, confirming prior findings on “hold-up” problems (Yue and Nagle,, 2024). These findings challenge the notion that AI democratisation efforts are primarily altruistic. OSS donations are a strategic calculation, where companies relinquish control of their OSS project in exchange for assumed benefits. This perspective aligns with and extends the work of Widder et al. Widder et al., (2023) and Srnicek Srnicek, (2022), who argue that open source practices in AI serve commercial interest. This should not be surprising, as for-profit companies are not charities, but it provides an empirical evidence base to illustrate that “there is no such thing as a free lunch” and to understand the possible objectives behind such AI democratisation efforts. Moreover, the analysis reveals the role of individual developers who advocate for OSS donations within their companies, demonstrating that donations result from an interplay between both individual and organisational incentives.

5.1.2 A Taxonomy for Understanding AI Democratisation Incentives

The findings can be summarised in a taxonomy of commercial incentives for AI OSS donations (see Table 4). Drawing on the taxonomy by Bonaccorsi and Rossi (2006, see Tables 1-2), this taxonomy categorises incentives at both individual and organisational levels, grouped into social, economic, and technological categories. It provides a framework for understanding the intersecting bottom-up and top-down incentives behind OSS donations, demonstrating that they are driven by a combination of individual and organisational factors. The value of this taxonomy lies in its ability to provide a structured approach to analysing the incentives for AI democratisation efforts, offering researchers and practitioners a tool to systematically examine the incentives at play in different scenarios.

The findings corroborate prior work on the salience of economic and technological incentives for companies (Bonaccorsi and Rossi,, 2006), with the democratisation of governance in large part used as a means to various economic and technological ends, such as recruiting external developers (Agerfalk and Fitzgerald,, 2008; Fink,, 2003), reducing development costs (Birkinbine,, 2020; Chesbrough,, 2023), and influencing industry standards (Lerner,, 2002; Lindman et al.,, 2009). According to the questionnaire, the most important incentive for companies is to recruit new contributors to the project, who can help improve the quality and competitiveness of the software, ultimately serving the interests of the donor company. As Respondent J (ONNX-MLIR) noted, “there’s an expectation that we’re going to benefit from a community helping us achieve our own goals.” These findings corroborate statements by Big Tech companies about their open source AI strategies, who want to “own the ecosystem and let open source work for us” (Patel and Ahmad,, 2023) and for “everyone to be using [LLaMA] because the more people who are using it, the more the flywheel will spin” (South Park Commons,, 2024). This aligns with ethical concerns about commercial exploitation of volunteer work in OSS without adequately reciprocating to the OSS ecosystem (Li et al.,, 2022; Birkinbine,, 2020).

However, the findings also add nuance to this perspective. While the promise of being able to attract new contributors to their project is indeed important for many companies, it is not universally paramount. For some companies, particularly those with substantial resources or an existing community, other incentives such as standard-setting take precedence (Lerner,, 2002; Lindman et al.,, 2009; Widder et al.,, 2023). The case of PyTorch, which was donated by Meta and already had a large contributor community, exemplifies this scenario. Even for companies primarily seeking to attract new contributors, the findings show that these benefits are far from guaranteed. The interviews identified two key challenges. First, the mere act of donation does not ensure that a project will attract new contributors. Second, even if a project can recruit new contributors, the quality of their contributions may not meet the project’s standards. Companies reject external contributions that “are not up to scratch” (Respondent D, Kedro) and need to invest in mentoring and training resources for external contributors (Respondent G, Ludwig). Companies may need to invest considerable time and resources into community building and contributor development, challenging the notion of open source as a straightforward cost-saving measure. This finding provides important nuance to the “free labour” thesis, and provides avenues for future research into the real costs and benefits of spinning-out or donating OSS projects.

It is also worth noting that the relative importance of these incentives vary depending on the context, company size, and type of the AI technology being “democratised” (e.g., OSS, AI model, or other). While this taxonomy provides a starting point for understanding commercial incentives for AI democratisation, it should be used as a flexible framework rather than a definitive list. Future research should build on it by testing its applicability to different approaches to AI democratisation, such as AI model and dataset releases (Seger et al., 2023b, ; Seger et al., 2023a, ; Don-Yehiya et al.,, 2024). On the one hand, incentives such as building a company’s reputation, increasing adoption, recruiting new contributors, reducing development costs, and shaping industry standards likely apply to AI model releases. On the other hand, the adoption of open governance is, at least to this day, specific to the context of OSS donations to foundations. To date, while companies have released models, they have not transferred them to a vendor-neutral foundation for open governance. This may in part be due to the lack of established practices for openly governing AI models given their relative complexity compared to OSS in terms of the resources required to develop and maintain AI models and their constituent components (White et al.,, 2024), or in part because the incentives may be more similar to those for spinning-out software, such as increasing adoption (West and O’Mahony,, 2005). At this stage, we can only speculate and, therefore, the commercial incentives and strategies for AI model releases, including governance choices, as well as their implications for norms and practices in the open source AI ecosystem, warrant further investigation.

Table 4: Commercial Incentives for AI OSS Donations
Category Individual Incentives Organisational Incentives
Social • Personal interests in AI and OSS
• Reciprocate to OSS ecosystem
• Build personal reputation
• Transition to open governance
• Reciprocate to community/ecosystem
• Build company’s reputation
• Join community of companies
Economic • Professional and career benefits
• Harness foundation services
• Recruit new contributors to project
• Reduce development costs
• Diversify project funding
• Harness foundation services
• Recruit talent to join company
• Increase competitive advantage
Technological • Ensure project sustainability
• Increase visibility and adoption
• Contribute to open source AI stack
• Access to new collaboration tools
• Increase visibility and adoption
• Accelerate innovation
• Improve software quality
• Foster technical interoperability
• Shape industry standards

5.1.3 From Intentions to Outcomes: Evaluating the Long-term Impact of OSS Donations

The findings capture the stated incentives of companies at a particular point in time. However, the actual realisation of these incentives and goals may vary and would require longitudinal study to verify. For instance, the study by Yue and Nagle, (2024) on PyTorch’s governance transition provides a compelling case study that both supports and challenges some of the incentives identified in this research. Their finding that the governance change led to increased participation from complementors (e.g., chip manufacturers) aligns with the incentive of recruiting new contributors identified in this study. However, the decrease in contributions from Meta following the transition suggests that the realisation of incentives may be more complex than initially anticipated. This complexity is further underscored by the fact that users (e.g., app developers and cloud providers) did not change their rate of participation, indicating that different stakeholders may respond differently to governance changes. These insights call for more longitudinal studies that track the outcomes of OSS donations over time. Such research could help verify whether long-term economic and technological benefits are indeed realised. Furthermore, it could shed light on how the balance of contributions shifts between the donor company and external contributors following a governance change. The PyTorch case also highlights the importance of considering the specific type of external contributors when evaluating the success of OSS donations. The differential response between complementors and users suggests that future research should delve deeper into how different types of stakeholders perceive and respond to changes in project governance. Future research could explore how companies balance the potential benefits of increased external contributions against the possible reduction in their own incentives to invest in the project. While this study provides a comprehensive taxonomy of incentives for OSS donations, the emerging evidence on the impacts of such donations highlights the need for further research.

5.2 Implications for Practice

The findings have several important implications for practitioners in open source AI development and governance. First, the findings underscore the need for greater transparency in corporate communications about AI democratisation efforts. The inconsistent use of “AI democratisation” underlines that practitioners should be more specific and transparent about their goals, as suggested by Seger et al., 2023b . Companies could clearly communicate the specific aspects of AI they aim to democratise (e.g., access, development, or governance) and explain how their actions contribute to these goals. Furthermore, greater transparency about the benefits that companies seek to gain from their democratisation efforts could foster more trust in corporate-community relations. This transparency could lead to more realistic expectations from the open source community and potentially more sustainable collaborations.

Second, the taxonomy provides a toolkit for researchers and practitioners to understand the various social, economic, and technological incentives that may be driving AI democratisation efforts. It is also useful in shedding light on relevant incentives at both the individual developer level and the organisational level, revealing that both bottom-up and top-down incentives may be at play when companies announce they are democratising AI. Furthermore, practitioners can use this toolkit to examine how well commercial incentives align with the broader interests of the open source AI community, which may help companies and volunteers identify areas of synergy and enable more ethical collaboration.

Third, the findings highlight the importance of foundations that facilitate collaboration between diverse stakeholders. While the benefits of foundations for developing and governing OSS are by now well understood, to this day companies are not transferring AI models to foundations for open governance. As discussed above, this may in part be due to the lack of established practices for openly governing AI models given their relative complexity in terms of the resources required to develop and maintain AI models and their various constituent components (White et al.,, 2024). A question for practitioners to explore is whether, and if so how, the foundation-hosting and open governance model could be adopted for the hosting and governance of AI models. To answer this question, AI companies should engage foundations, not just as potential hosts of models, but as active partners in shaping the future of open source AI development.

By employing the toolkit and implementing these practices, practitioners can work towards a more transparent and sustainable approach to AI democratisation that acknowledges commercial interests while fostering the broader goals of open and collaborative AI development. This approach could lead to more ethical and effective AI technologies that better serve societal needs while still allowing for innovation and commercial success.

5.3 Threats to Validity

The validity of the findings are evaluated with reference to guidance for case study research and qualitative research methods in software engineering research (Runeson and Höst,, 2008; Yin,, 2018; Easterbrook et al.,, 2008).

5.3.1 Credibility

Credibility refers to the believability of the findings (Easterbrook et al.,, 2008). The primary threat stems from the author’s affiliation with the LF as a research contractor, creating potential conflicts of interest and research biases, despite the independent purpose and funding of this research study. As explained in Section 3.3.3, a social identity map was used to identify and address potential biases in three areas: data access, collection, and analysis (Jacobson and Mustafa,, 2019). As previously stated, while a number of step were taken to enhance the credibility of the research process, it is acknowledged that there is nonetheless a risk of biases that were not controlled for or mitigated and as such it should be understood as an imperfect but best-effort attempt by the author. Another challenge concerned gaining access to data about commercial strategies. The working solution involved drawing on four sources—pre-donation technical pitches, post-donation press releases, questionnaires, and interviews—which offered a triangulated account of incentives (Yin,, 2018). While this approach provided four perspectives, it was inevitably constrained by the limited participation of company-affiliated developers, who were willing or allowed to participate in a research interview, and by the fact that some strategic incentives may never be shared publicly. In addition, the willingness of maintainers to complete the questionnaire and/or participate in an interview may have been affected by company policies (e.g., NDAs).

5.3.2 Robustness

Robustness concerns strength, reliability, and soundness of the study’s design, methods, and findings. There was a risk of social desirability bias or response bias in the interviews, owing to the author’s affiliation with the LF. Steps were taken to minimise the risk of these biases, such as proactively communicating the independent purpose and funding of this study in the interview invitations and at the beginning of every interview. Another threat stems from the thematic analysis. The author sought to maximise the robustness of this analysis by following best-practice guidelines (Braun and Clarke,, 2006) and integrating approaches to coding qualitative data (Cruzes and Dyba,, 2011). In addition, the structure of the taxonomy poses risks to the validity of the findings. In some cases, it was difficult to demarcate incentives at the two units of analysis, which may have led to a misclassification or oversimplification. That being said, efforts were made to develop the categories by thoroughly reviewing prior work and member-checking findings with interviewees to evaluate their accuracy and resonance with practitioners (Lincoln and Guba,, 1985).

5.3.3 Transferability

Transferability concerns the generalisability of qualitative research findings. There are two key threats to generalisability. First, the narrow focus on OSS donations as a method of AI democratisation limits the generalisability of the taxonomy, which both includes aspects specific to OSS donations (e.g., democratising governance or foundation support) and excludes aspects that may be typical of other methods of AI democratisation (e.g., open model releases). Second, this study may be limited by the specific characteristics of the sample of companies, OSS projects, and the foundations. The sample was drawn from the LF AI & Data Foundation and PyTorch Foundation, two foundations under the LF, which may not fully represent other foundations. Future research directions were recommended in Section 5.1 to advance our understanding of the commercial incentives for different approaches to AI democratisation.

5.3.4 Dependability

Dependability refers to the consistency of the research process. A comprehensive list of secondary documents collected for each OSS project is provided in Table LABEL:tab:donations-projects. These documents were triangulated by questionnaires and interviews with maintainers and staff from the foundations. With their consent, all interviews were recorded and transcribed to aid the analysis (Yin,, 2018). With regards to the data analysis, common guidelines were followed for the systematic analysis of qualitative data (Braun and Clarke,, 2006; Cruzes and Dyba,, 2011). Additionally, the accuracy of the analysis was validated through member-checking findings with interviewees, ensuring their resonance with practitioners.

6 Conclusion

Companies are “democratising” AI through various approach and with various goals. While these efforts tend to be celebrated for facilitating science and innovation, the strategic incentives driving these apparently altruistic acts are in large part hidden from public view. However, as the development and application of AI technologies has an ever-increasing impact on society and the economy, it becomes imperative to better understand the incentives and strategies for AI democratisation as well as whose interests AI democratisation ultimately serves. Towards this end, this study employed a mixed-methods approach to investigate the commercial incentives for 43 AI OSS donations to the LF. The findings highlight an interplay of social, economic, and technological incentives at both individual and organisational levels. In the case of OSS donations, the democratisation of governance is treated as a social means for primarily economic and technological ends, such as attracting external contributors, reducing development costs, and influencing industry standards, among others. Furthermore, the study illustrates the role of individual developers, who champion OSS donations within their companies, thus highlighting the relevance of the bottom-up incentives for AI democratisation. While some incentives are unique to OSS donations, the taxonomy serves both as a theoretical foundation and as a practical toolkit for understanding the commercial incentives for related AI democratisation efforts, such as AI model releases. As the number of open models continues to grow at a rapid pace, it is timely for researchers to turn their attention to the commercial incentives and strategies driving these AI democratisation efforts, as well as the effects thereof on the norms, practices, and potential trajectories of the open source AI ecosystem at large.

Acknowledgements

The author thanks all research subjects for their participation in this study.

Declarations

Funding

This study was funded by the UK Economic and Social Research Council (Grant Number: ES/P000649/1).

Competing interests

The author was affiliated with the LF as a research contractor during the research period. However, the study was independently funded by the UK Economic and Social Research Council (Grant Number: ES/P000649/1).

Ethics approval

This study obtained ethical clearance from University of Oxford’s Research Ethics Committee prior to data collection.

Consent

As per the ethics approval requirements at the University of Oxford, all research subjects were asked to provide informed consent prior to data collection and were informed of their rights to withdraw their data at any stage. In addition, research subjects confirmed the affiliations and quotes included in this paper prior to its submission for publication.

Data and/or Code availability

N/A

Authors’ contributions

All contributions were made by the author and any errors made are his.

References

  • Agerfalk and Fitzgerald, (2008) Agerfalk, P. J. and Fitzgerald, B. (2008). Outsourcing to an unknown workforce: exploring opensourcing as a global sourcing strategy. MIS Quarterly, 32(2):385–409.
  • Ahlawat et al., (2021) Ahlawat, P., Boyne, J., Herz, D., Schmieg, F., and Stephan, M. (2021). Why You Need an Open Source Software Strategy.
  • Ahmed et al., (2023) Ahmed, N., Wahed, M., and Thompson, N. C. (2023). The growing influence of industry in AI research. Science, 379(6635):884–886. Publisher: American Association for the Advancement of Science.
  • Akiki et al., (2022) Akiki, C., Pistilli, G., Mieskes, M., Gallé, M., Wolf, T., Ilić, S., and Jernite, Y. (2022). BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model. arXiv:2212.04960 [cs].
  • Alrawashdeh et al., (2020) Alrawashdeh, T. A., Elbes, M. W., Almomani, A., ElQirem, F., and Tamimi, A. (2020). User acceptance model of open source software: An integrated model of OSS characteristics and UTAUT. Journal of Ambient Intelligence and Humanized Computing, 11:3315–3327. ISBN: 1868-5137 Publisher: Springer.
  • Bagozzi and Dholakia, (2006) Bagozzi, R. P. and Dholakia, U. M. (2006). Open source software user communities: A study of participation in linux user groups: Open source software. Management science, 52(7):1099–1115. Place: Linthicum, MD Publisher: Institute for Operations Research and the Management Sciences.
  • Barcomb et al., (2020) Barcomb, A., Kaufmann, A., Riehle, D., Stol, K.-J., and Fitzgerald, B. (2020). Uncovering the Periphery: A Qualitative Survey of Episodic Volunteering in Free/Libre and Open Source Software Communities. IEEE transactions on software engineering, 46(9):962–980. Place: LOS ALAMITOS Publisher: IEEE.
  • Benkler, (2002) Benkler, Y. (2002). Coase’s Penguin, or, Linux and "The Nature of the Firm". The Yale Law journal, 112(3):369–446. Place: New Haven Publisher: The Yale Law Journal Company.
  • Benkler, (2006) Benkler, Y. (2006). The Wealth of Networks: How Social Production Transforms Markets and Freedom. Yale University Press, New Haven [Conn.
  • Bergquist and Ljungberg, (2001) Bergquist, M. and Ljungberg, J. (2001). The power of gifts: organizing social relationships in open source communities. Information systems journal (Oxford, England), 11(4):305–320. Place: Oxford, UK Publisher: Blackwell Science Ltd.
  • Bezroukov, (1999) Bezroukov, N. (1999). Open source software as a special type of academic research: Critique of the vulgar Raymondism. First Monday.
  • Birkinbine, (2018) Birkinbine, B. (2018). Commons Praxis: Toward a Critical Political Economy of the Digital Commons. tripleC: Communication, Capitalism & Critique. Open Access Journal for a Global Sustainable Information Society.
  • Birkinbine, (2020) Birkinbine, B. (2020). Incorporating the Digital Commons: Corporate Involvement in Free and Open Source Software. University of Westminster Press.
  • Bitzer et al., (2007) Bitzer, J., Schrettl, W., and Schröder, P. J. (2007). Intrinsic motivation in open source software development. Journal of Comparative Economics, 35(1):160–169. Place: San Diego Publisher: Elsevier Inc.
  • Bitzer and Schroder, (2006) Bitzer, J. and Schroder, P. J. H. (2006). The Economics of Open Source Software Development: An Introduction. In The Economics of Open Source Software Development. Emerald Publishing Limited, United Kingdom.
  • Bonaccorsi and Rossi, (2006) Bonaccorsi, A. and Rossi, C. (2006). Comparing motivations of individual programmers and firms to take part in the open source movement: From community to business. Knowledge, Technology & Policy, page 25.
  • Bonaccorsi and Rossi, (2003) Bonaccorsi, A. and Rossi, C. L. (2003). Altruistic Individuals, Selfish Firms? The Structure of Motivation in Open Source Software. SSRN Scholarly Paper ID 433620, Social Science Research Network, Rochester, NY.
  • Braun and Clarke, (2006) Braun, V. and Clarke, V. (2006). Using thematic analysis in psychology. Qualitative research in psychology, 3(2):77–101. Place: London Publisher: Taylor & Francis Group.
  • Broca, (2021) Broca, S. (2021). Communs et capitalisme numérique : histoire d’un antagonisme et de quelques affinités électives. Terminal. Technologie de l’information, culture & société, (130). Number: 130 Publisher: Creis-Terminal.
  • Butler et al., (2018) Butler, S., Gamalielsson, J., Lundell, B., Jonsson, P., Sjoberg, J., Mattsson, A., Ricko, N., Gustavsson, T., Feist, J., Landemoo, S., and Lonroth, E. (2018). An Investigation of Work Practices Used by Companies Making Contributions to Established OSS Projects. In 2018 IEEE/ACM 40th International Conference on Software Engineering: Software Engineering in Practice Track (ICSE-SEIP), pages 201–210. ACM.
  • Candela, (2017) Candela, J. Q. (2017). Facebook and Microsoft introduce new open ecosystem for interchangeable AI frameworks - Meta Research | Meta Research.
  • Capiluppi and Michlmayr, (2007) Capiluppi, A. and Michlmayr, M. (2007). From the cathedral to the bazaar: An empirical study of the lifecycle of volunteer community projects. In IFIP International Conference on Open Source Systems, pages 31–44. Springer.
  • Charmaz, (2006) Charmaz, K. (2006). Constructing grounded theory: A practical guide through qualitative analysis. SAGE.
  • Chesbrough, (2023) Chesbrough, H. (2023). Measuring the Economic Value of Open Source. Technical report, Linux Foundation, San Francisco, CA, USA.
  • Clegg, (2023) Clegg, N. (2023). Nick Clegg: Openness on AI is the way forward for tech. Financial Times.
  • Crowston and Scozzi, (2002) Crowston, K. and Scozzi, B. (2002). Open source software projects as virtual organisations: competency rallying for software development. IEE Proceedings-Software, 149(1):3–17. Publisher: IET.
  • Crowston and Scozzi, (2008) Crowston, K. and Scozzi, B. (2008). Bug Fixing Practices within Free/Libre Open Source Software Development Teams. Journal of database management, 19(2):1–30. Publisher: IGI Global.
  • Cruzes and Dyba, (2011) Cruzes, D. S. and Dyba, T. (2011). Recommended Steps for Thematic Synthesis in Software Engineering. In 2011 International Symposium on Empirical Software Engineering and Measurement, pages 275–284. ISSN: 1949-3789.
  • Dahlander and Magnusson, (2005) Dahlander, L. and Magnusson, M. G. (2005). Relationships between open source software companies and communities: Observations from Nordic firms. Research Policy, 34(4):481–493.
  • Dahlander and Wallin, (2006) Dahlander, L. and Wallin, M. W. (2006). A man on the inside: Unlocking communities as complementary assets. Research Policy, 35(8):1243–1259.
  • David and Shapiro, (2008) David, P. and Shapiro, J. (2008). Community-based production of open-source software: What do we know about the developers who participate? Information Economics and Policy, 20(4):364–398. Publisher: North-Holland.
  • David et al., (2003) David, P., Waterman, A., and Arora, S. (2003). FLOSS-US: The Free/Libre/Open Source Software Survey for 2003. Technical report, Stanford Institute for Economic Policy Research, Stanford, CA, USA.
  • Di Giacomo et al., (2020) Di Giacomo, D., Kudzmanaite, B., Deveny, V., Dussutour, C., and Shaikh, M. (2020). Key success factors of sustainable open source communities. Technical report, EU, Brussels, Belgium.
  • Ding et al., (2023) Ding, J., Akiki, C., Jernite, Y., Steele, A. L., and Popo, T. (2023). Towards Openness Beyond Open Access: User Journeys through 3 Open AI Collaboratives. arXiv:2301.08488 [cs].
  • Dolan, (2023) Dolan, M. (2023). How open source foundations protect the licensing integrity of open source projects.
  • Don-Yehiya et al., (2024) Don-Yehiya, S., Burtenshaw, B., Astudillo, R. F., Osborne, C., Jaiswal, M., Kuo, T.-S., Zhao, W., Shenfeld, I., Peng, A., Yurochkin, M., and others (2024). The Future of Open Human Feedback. arXiv preprint arXiv:2408.16961.
  • Easterbrook et al., (2008) Easterbrook, S., Singer, J., Storey, M.-A., and Damian, D. (2008). Selecting Empirical Methods for Software Engineering Research. In Shull, F., Singer, J., and Sjøberg, D. I. K., editors, Guide to Advanced Empirical Software Engineering, pages 285–311. Springer, London.
  • Edwards and Holland, (2013) Edwards, R. and Holland, J. (2013). How have qualitative interviews developed? In What is Qualitative Interviewing?, The ’What is?’ Research Methods Series, pages 11–28. Bloomsbury Academic, London, 1 edition.
  • Eisenhardt, (1989) Eisenhardt, K. M. (1989). Building Theories from Case Study Research. The Academy of Management review, 14(4):532–550. Place: Ada, Ohio, etc Publisher: Academy of Management.
  • Feller and Fitzgerald, (2002) Feller, J. and Fitzgerald, B. (2002). Understanding Open Source Software development. Pearson Education, London.
  • Fershtman and Gandal, (2004) Fershtman, C. and Gandal, N. (2004). The Determinants of Output Per Contributor in Open Source Projects: An Empirical Examination.
  • Fink, (2003) Fink, M. (2003). The business and economics of Linux and open source. Prentice Hall Professional.
  • Finlay, (2002) Finlay, L. (2002). “Outing” the Researcher: The Provenance, Process, and Practice of Reflexivity. Qualitative Health Research, 12(4):531–545. Publisher: SAGE Publications Inc.
  • Franck and Jungwirth, (2002) Franck, E. and Jungwirth, C. (2002). Reconciling investors and donators - The governance structure of open source. Working Papers. Number: 0008 Publisher: University of Zurich, Institute for Strategy and Business Economics (ISU).
  • Franke and Hippel, (2003) Franke, N. and Hippel, E. v. (2003). Satisfying heterogeneous user needs via innovation toolkits: the case of Apache security software. Research policy, 32(7):1199–1215. Publisher: Elsevier B.V.
  • Germonprez et al., (2013) Germonprez, M., Allen, J. P., Warner, B., Hill, J., and McClements, G. (2013). Open source communities of competitors. Interactions, 20(6):54–59.
  • Gerosa et al., (2021) Gerosa, M., Wiese, I., Trinkenreich, B., Link, G., Robles, G., Treude, C., Steinmacher, I., and Sarma, A. (2021). The Shifting Sands of Motivation: Revisiting What Drives Contributors in Open Source. In Proceedings of the 43rd International Conference on Software Engineering, ICSE ’21, pages 1046–1058, Madrid, Spain. IEEE Press.
  • Ghosh, (1998) Ghosh, R. A. (1998). Interviews with Linus Torvalds: What motivates software developers. First Monday.
  • Ghosh, (2007) Ghosh, R. A. (2007). Understanding Free Software Developers: Findings from the FLOSS Study. In Feller, J., Fitzgerald, B., Hissam, S., and Lakhani, K., editors, Perspectives on Free and Open Source Software. MIT Press, Cambridge, MA, USA.
  • Ghosh et al., (2002) Ghosh, R. A., Glott, R., Krieger, B., and Robles, G. (2002). Free/Libre and Open Source Software: Survey and Study. International Institute of Infonomics.
  • Ghosh and Schmidt, (2006) Ghosh, R. A. and Schmidt, J. P. (2006). Open source and open standards: a new Frontier for economic development? ISBN: 1814-8026 Publisher: United Nations University.
  • GitHub, (2023) GitHub (2023). Machine Learning and Artificial Intelligence repositories on GitHub.
  • Green, (2000) Green, L. (2000). Economics of Open Source Software.
  • Haddad, (2022) Haddad, I. (2022). Artificial Intelligence and Data in Open Source. Technical report, Linux Foundation.
  • Hann et al., (2002) Hann, I., Roberts, J., Slaughter, S., and Fielding, R. (2002). Economic Incentives for Participating in Open Source Software Projects. Proceedings of the International Conference on Information Systems.
  • Hars and Ou, (2002) Hars, A. and Ou, S. (2002). Working for Free? Motivations for Participating in Open-Source Projects. International journal of electronic commerce, 6(3):25–39. Publisher: Routledge.
  • Haruvy et al., (2003) Haruvy, E., Prasad, A., and Sethi, S. (2003). Harvesting Altruism in Open-Source Software Development. Journal of Optimization Theory and Applications, 118(2):381–416.
  • Hawkins, (2004) Hawkins, R. E. (2004). The economics of open source software for a competitive firm: Why give it away for free? Netnomics, 6(2):103–117. Place: Dordrecht Publisher: Springer Nature B.V.
  • Hecker, (1999) Hecker, F. (1999). Setting up shop: The business of open-source software. IEEE Software, 16(1):45–51.
  • Hemetsberger, (2002) Hemetsberger, A. (2002). Fostering Cooperation on the Internet: Social Exchange Processes in Innovative Virtual Consumer Communities. ACR North American Advances, NA-29.
  • Henkel, (2006) Henkel, J. (2006). Selective revealing in open innovation processes: The case of embedded Linux. Research policy, 35(7):953–969. Place: Amsterdam Publisher: Elsevier B.V.
  • Herriott and Firestone, (1983) Herriott, R. E. and Firestone, W. A. (1983). Multisite Qualitative Policy Research: Optimizing Description and Generalizability. Educational researcher, 12(2):14–19. Publisher: American Educational Research Association.
  • Hertel et al., (2003) Hertel, G., Niedner, S., and Herrmann, S. (2003). Motivation of software developers in Open Source projects: an Internet-based survey of contributors to the Linux kernel. Research Policy, 32(7):1159–1177.
  • Hossain, (2021) Hossain, A. (2021). Regional OSS Communities: The View From Dhaka, Bangladesh.
  • Howison and Herbsleb, (2013) Howison, J. and Herbsleb, J. D. (2013). Incentives and integration in scientific software production. In Proceedings of the 2013 conference on Computer supported cooperative work, CSCW ’13, pages 459–470, New York, NY, USA. Association for Computing Machinery.
  • Jacobson and Mustafa, (2019) Jacobson, D. and Mustafa, N. (2019). Social Identity Map: A Reflexivity Tool for Practicing Explicit Positionality in Critical Qualitative Research. International journal of qualitative methods, 18:160940691987007–. Place: Los Angeles, CA Publisher: SAGE Publications.
  • Kak and Myers West, (2023) Kak, A. and Myers West, S. (2023). AI Now 2023 Landscape: Confronting Tech Power. Technical report, AI Now Institute.
  • Kelty, (2008) Kelty, C. M. (2008). Two bits: the cultural significance of free software. Experimental futures. Duke University Press, Durham, N.C. ; London.
  • Kendall et al., (2016) Kendall, J. E., Kendall, K. E., and Germonprez, M. (2016). Game theory and open source contribution: Rationale behind corporate participation in open source software development. Journal of Organizational Computing and Electronic Commerce, 26(4):323–343.
  • King, (2009) King, N. (2009). Interviews in qualitative research. SAGE, London. Book Title: Interviews in qualitative research.
  • Kollock, (1999) Kollock, P. (1999). Economics of on line cooperation: Gift and public goods in cyberspace. In Smith, M. and Kollock, P., editors, Communities in Cyberspace, pages 220–246. Routledge, London, UK.
  • Krishnamurthy, (2005) Krishnamurthy, S. (2005). Cave or community? An empirical examination of 100 mature open source projects. First Monday.
  • Krishnamurthy, (2006) Krishnamurthy, S. (2006). On the intrinsic and extrinsic motivation of free/libre/open source (FLOSS) developers. Knowledge, Technology & Policy, 18(4):17–39.
  • Kuan, (2001) Kuan, J. W. (2001). Open Source Software as Consumer Integration into Production.
  • Lakhani and von Hippel, (2003) Lakhani, K. R. and von Hippel, E. (2003). How open source software works: “free” user-to-user assistance. Research policy, 32(6):923–943. Place: Amsterdam Publisher: Elsevier B.V.
  • Lakhani et al., (2002) Lakhani, K. R., Wolf, B., Bates, J., and DiBona, C. (2002). Release 0.73 In Cooperation with OSDN. Technical report, The Boston Consulting Group, Cambridge, MA, USA.
  • Lakhani and Wolf, (2003) Lakhani, K. R. and Wolf, R. G. (2003). Why Hackers Do What They Do: Understanding Motivation and Effort in Free/Open Source Software Projects.
  • Langenkamp and Yue, (2022) Langenkamp, M. and Yue, D. N. (2022). How Open Source Machine Learning Software Shapes AI. In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’22, pages 385–395, New York, NY, USA. Association for Computing Machinery.
  • Lattemann and Stieglitz, (2005) Lattemann, C. and Stieglitz, S. (2005). Framework for Governance in Open Source Communities. In Proceedings of the 38th Annual Hawaii International Conference on System Sciences, pages 192a–192a. ISSN: 1530-1605.
  • Lee and Herstatt, (2015) Lee, V. and Herstatt, C. (2015). How Firms Can Strategically Influence Open Source Communities: The Employment of ‘Men on the Inside’. In Open Source Innovation. Routledge. Num Pages: 35.
  • Lehdonvirta, (2023) Lehdonvirta, V. (2023). Behind AI, a massive infrastructure is changing geopolitics.
  • Lehdonvirta et al., (2019) Lehdonvirta, V., Kässi, O., Hjorth, I., Barnard, H., and Graham, M. (2019). The Global Platform Economy: A New Offshoring Institution Enabling Emerging-Economy Microproviders. Journal of Management, 45(2):567–599. Publisher: SAGE Publications Inc.
  • Lehdonvirta et al., (2023) Lehdonvirta, V., Wu, B., and Hawkins, Z. (2023). Cloud empires’ physical footprint: How trade and security politics shape the global expansion of U.S. and Chinese data centre infrastructures.
  • Lerner, (2002) Lerner, J. (2002). Incentives and Open Source software.
  • Lerner and Tirole, (2001) Lerner, J. and Tirole, J. (2001). The open source movement: Key research questions. European Economic Review, 45(4):819–826.
  • Lerner and Tirole, (2002) Lerner, J. and Tirole, J. (2002). Some Simple Economics of Open Source. The Journal of Industrial Economics, 50(2):197–234.
  • LFAI&Data, (2023) LFAI&Data (2023). LF AI & Data Foundation.
  • Li et al., (2022) Li, H., Ajmani, L., Zhou, M., Vincent, N., Hwang, S., Piccardi, T., Narayan, S., Daniel, S., and Veselovsky, V. (2022). Ethical Tensions, Norms, and Directions in the Extraction of Online Volunteer Work. In Companion Publication of the 2022 Conference on Computer Supported Cooperative Work and Social Computing, CSCW’22 Companion, pages 273–277, New York, NY, USA. Association for Computing Machinery.
  • Li et al., (2024) Li, X., Zhang, Y., Osborne, C., Zhou, M., Jin, Z., and Liu, H. (2024). Systematic Literature Review of Commercial Participation in Open Source Software. ACM Transactions on Software Engineering and Methodology. Just Accepted.
  • Liesenfeld and Dingemanse, (2024) Liesenfeld, A. and Dingemanse, M. (2024). Rethinking open source generative AI: open-washing and the EU AI Act. ACM.
  • Liesenfeld et al., (2023) Liesenfeld, A., Lopez, A., and Dingemanse, M. (2023). Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generators. In Proceedings of the 5th International Conference on Conversational User Interfaces, CUI ’23, pages 1–6, New York, NY, USA. Association for Computing Machinery.
  • Lincoln and Guba, (1985) Lincoln, Y. S. and Guba, E. G. (1985). Naturalistic inquiry. Sage, Beverly Hills ; London.
  • Lindman et al., (2009) Lindman, J., Juutilainen, J.-P., and Rossi, M. (2009). Beyond the Business Model: Incentives for Organizations to Publish Software Source Code. In Boldyreff, C., Crowston, K., Lundell, B., and Wasserman, A. I., editors, Open Source Ecosystems: Diverse Communities Interacting, IFIP Advances in Information and Communication Technology, pages 47–56, Berlin, Heidelberg. Springer.
  • Link and Germonprez, (2016) Link, G. and Germonprez, M. (2016). Understanding open source communities as complex adaptive systems: A case of the R Project community.
  • Linåker et al., (2016) Linåker, J., Rempel, P., Regnell, B., and Mäder, P. (2016). How Firms Adapt and Interact in Open Source Ecosystems: Analyzing Stakeholder Influence and Collaboration Patterns. In Daneva, M. and Pastor, O., editors, Requirements Engineering: Foundation for Software Quality, Lecture Notes in Computer Science, pages 63–81, Cham. Springer International Publishing.
  • Linåker and Runeson, (2020) Linåker, J. and Runeson, P. (2020). Public Sector Platforms going Open: Creating and Growing an Ecosystem with Open Collaborative Development. In Proceedings of the 16th International Symposium on Open Collaboration, OpenSym ’20, pages 1–10, New York, NY, USA. Association for Computing Machinery.
  • Loebbecke and Angehrn, (2003) Loebbecke, C. and Angehrn, A. A. (2003). Open Source Platforms Under Co-opetition: A Comparative Analysis of SourceForge and ’CodeX’ (Xerox) as Two ’Co-opetitive Learning and Knowledge Exchange Networks’ (CoLKENs).
  • Luthiger and Jungwirth, (2007) Luthiger, B. and Jungwirth, C. (2007). Pervasive fun. First Monday.
  • Maffulli, (2023) Maffulli, S. (2023). Meta’s LLaMa 2 license is not Open Source.
  • Markus et al., (2000) Markus, L., Manville, B., and Agres, C. (2000). What Makes a Virtual Organization Work: Lessons From the Open-Source World. MIT Sloan Management Review.
  • Marlow and Dabbish, (2013) Marlow, J. and Dabbish, L. (2013). Activity traces and signals in software developer recruitment and hiring. In Proceedings of the 2013 conference on Computer supported cooperative work, CSCW ’13, pages 145–156, New York, NY, USA. Association for Computing Machinery.
  • Marsan et al., (2012) Marsan, J., Paré, G., and Beaudry, A. (2012). Adoption of open source software in organizations: A socio-cognitive perspective. The Journal of Strategic Information Systems, 21(4):257–273.
  • Meta, (2023) Meta (2023). Meta and Microsoft Introduce the Next Generation of Llama.
  • Mugrage, (2022) Mugrage, K. (2022). The changing economics of open source. MIT Technology Review.
  • Nagle et al., (2020) Nagle, F., Wheeler, D. A., Lifshitz-Assaf, H., Ham, H., and Hoffman, J. L. (2020). Report on the 2020 FOSS Contributor Survey. Technical report, The Linux Foundation, San Francisco, CA, USA.
  • Nguyen Duc et al., (2017) Nguyen Duc, A., Cruzes, D. S., Hanssen, G. K., Snarby, T., and Abrahamsson, P. (2017). Coopetition of Software Firms in Open Source Software Ecosystems. In Ojala, A., Holmström Olsson, H., and Werder, K., editors, Software Business, Lecture Notes in Business Information Processing, pages 146–160, Cham. Springer International Publishing.
  • Nguyen-Duc et al., (2019) Nguyen-Duc, A., Cruzes, D. S., Terje, S., and Abrahamsson, P. (2019). Do software firms collaborate or compete? A model of coopetition in community-initiated OSS projects. e-Informatica Vol. XIII. arXiv:1808.06489 [cs].
  • O’Brien, (2019) O’Brien, D. (2019). The FOSS Contributor Fund: Forming a Community of Adopters.
  • Okoli and Oh, (2007) Okoli, C. and Oh, W. (2007). Investigating recognition-based performance in an open content community: A social capital perspective. Information & Management, 44(3):240–252.
  • O’Mahony and Bechky, (2008) O’Mahony, S. and Bechky, B. A. (2008). Boundary Organizations: Enabling Collaboration among Unexpected Allies. Administrative Science Quarterly, 53(3):422–459. Publisher: SAGE Publications Inc.
  • O’Mahony and Ferraro, (2007) O’Mahony, S. and Ferraro, F. (2007). The Emergence of Governance in an Open Source Community. Academy of Management journal, 50(5):1079–1106. Place: Briarcliff Manor Publisher: Academy of Management.
  • Oreg and Nov, (2008) Oreg, S. and Nov, O. (2008). Exploring motivations for contributing to open source initiatives: The roles of contribution context and personal values. Computers in Human Behavior, 24(5):2055–2073.
  • Osborne, (2024) Osborne, C. (2024). Public-private funding models in open source software development: A case study on scikit-learn. arXiv:2404.06484 [cs].
  • Osborne et al., (2024) Osborne, C., Ding, J., and Kirk, H. R. (2024). The AI community building the future? A quantitative analysis of development activity on Hugging Face Hub. Journal of Computational Social Science.
  • Osterloh et al., (2003) Osterloh, M., Rota, S., and Kuster, B. (2003). Trust and Commerce in Open Source — A Contradiction? pages 129–141.
  • Patel and Ahmad, (2023) Patel, D. and Ahmad, A. (2023). Google "We Have No Moat, And Neither Does OpenAI".
  • Pavlicek, (2000) Pavlicek, R. C. (2000). Keys to Effective Linux Advocacy Within Your Organization.
  • Pfaff and David, (1998) Pfaff, B. and David, K. (1998). Open source software is better for society than proprietary closed source software.
  • Pitt et al., (2006) Pitt, L. F., Watson, R. T., Berthon, P., Wynn, D., and Zinkhan, G. (2006). The Penguin’s Window: Corporate Brands From an Open-Source Perspective. Journal of the Academy of Marketing Science, 34(2):115–127. Publisher: SAGE Publications Inc.
  • PyTorch, (2023) PyTorch (2023). About the PyTorch Foundation.
  • (121) Raymond, E. S. (2001a). The Cathedral and the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary. O’Reilly Media, Incorporated, Sebastopol.
  • (122) Raymond, E. S. (2001b). Homesteading the Noosphere. In The Cathedral and the Bazaar. O’Reilly Media, Incorporated, United States.
  • Riehle, (2007) Riehle, D. (2007). The Economic Motivation of Open Source Software: Stakeholder Perspectives. Computer, 40(4):25–32. Conference Name: Computer.
  • Roberts et al., (2006) Roberts, J. A., Hann, I.-H., and Slaughter, S. A. (2006). Understanding the Motivations, Participation, and Performance of Open Source Software Developers: A Longitudinal Study of the Apache Projects. Management science, 52(7):984–999. Place: Linthicum Publisher: INFORMS.
  • Runeson and Höst, (2008) Runeson, P. and Höst, M. (2008). Guidelines for conducting and reporting case study research in software engineering. Empirical software engineering : an international journal, 14(2):131–164. Place: Boston Publisher: Springer US.
  • Santos et al., (2013) Santos, C., Kuk, G., Kon, F., and Pearson, J. (2013). The attraction of contributors in free and open source software projects. The Journal of Strategic Information Systems, 22(1):26–45.
  • Schofield and Cooper, (2006) Schofield, A. and Cooper, G. (2006). Participation in Free and Open Source Communities: An Empirical Study of Community Members’ Perceptions. In Open Source Systems: IFIP Working Group 2.13 Foundation on Open Source Software, Como, Italy.
  • Schoonmaker, (2018) Schoonmaker, S. (2018). Free Software, the Internet, and Global Communities of Resistance. Routledge, New York.
  • (129) Seger, E., Dreksler, N., Moulange, R., Dardaman, E., Schuett, J., Wei, K., Winter, C., Arnold, M., O hEigeartaigh, S., Korinek, A., Anderljung, M., Bucknall, B., Chan, A., Stafford, E., Koessler, L., Ovadya, A., Garfinkel, B., Bluemke, E., Aird, M., Levermore, P., Hazell, J., and Gupta, A. (2023a). Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives. arXiv:2311.09227 [cs].
  • (130) Seger, E., Ovadya, A., Garfinkel, B., Siddarth, D., and Dafoe, A. (2023b). Democratising AI: Multiple Meanings, Goals, and Methods. arXiv:2303.12642 [cs].
  • Shah, (2006) Shah, S. K. (2006). Motivation, Governance, and the Viability of Hybrid Forms in Open Source Software Development. Management science, 52(7):1000–1014. Place: Linthicum Publisher: INFORMS.
  • South Park Commons, (2024) South Park Commons (2024). Mark Zuckerberg on Llama, AI, & Minus One.
  • Spaeth et al., (2008) Spaeth, S., Haefliger, S., von Krogh, G., and Renzl, B. (2008). Communal Resources in Open Source Software Development. Information Research: An International Electronic Journal, 13(1). Publisher: Thomas D ERIC Number: EJ837230.
  • Srnicek, (2022) Srnicek, N. (2022). Data, Compute, Labor. In Graham, M. and Ferrari, F., editors, Digital work in the planetary market.
  • Stallman, (1984) Stallman, R. (1984). The GNU Manifesto - GNU Project - Free Software Foundation.
  • Stewart and Gosain, (2006) Stewart, K. J. and Gosain, S. (2006). The Impact of Ideology on Effectiveness in Open Source Software Development Teams. MIS Quarterly, 30(2):291–314. Publisher: Management Information Systems Research Center, University of Minnesota.
  • Subramanyam and Xia, (2008) Subramanyam, R. and Xia, M. (2008). Free/Libre Open Source Software development in developing and developed countries: A conceptual framework with an exploratory study. Decision Support Systems, 46(1):173–186. Place: Amsterdam Publisher: Elsevier B.V.
  • Takhteyev, (2012) Takhteyev, Y. (2012). Coding places: software practice in a South American city. Acting with technology. MIT Press, Cambridge, Mass.
  • Teixeira, (2014) Teixeira, J. (2014). Understanding Coopetition in the Open-Source Arena: The Cases of WebKit and OpenStack. In Proceedings of The International Symposium on Open Collaboration, OpenSym ’14, pages 1–5, New York, NY, USA. Association for Computing Machinery.
  • Teixeira and Lin, (2014) Teixeira, J. and Lin, T. (2014). Collaboration in the open-source arena: The WebKit case. Proceedings of the 52nd ACM conference on Computers and people research - SIGSIM-CPR ’14, pages 121–129. arXiv: 1401.5996.
  • Teixeira et al., (2016) Teixeira, J., Mian, S. Q., and Hytti, U. (2016). Cooperation among competitors in the open-source arena. In IS in Organizations and Society, page 38, Dublin, Ireland.
  • Teixeira et al., (2015) Teixeira, J., Robles, G., and González-Barahona, J. M. (2015). Lessons learned from applying social network analysis on an industrial Free/Libre/Open Source Software ecosystem. Journal of Internet Services and Applications, 6(1):14.
  • Torvalds, (2001) Torvalds, L. (2001). Just for fun: the story of an accidental revolutionary. Texere, New York ; London.
  • Varoquaux et al., (2024) Varoquaux, G., Luccioni, A. S., and Whittaker, M. (2024). Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI. arXiv:2409.14160 [cs].
  • von Hippel, (2001) von Hippel, E. (2001). Innovation by User Communities: Learning from Open Source Software. Sloan Management Review.
  • von Hippel, (2003) von Hippel, E. (2003). Open Source Projects as Horizontal Innovation Networks - By and for users. IDEAS Working Paper Series from RePEc. Place: St. Louis Publisher: Federal Reserve Bank of St Louis.
  • von Krogh et al., (2012) von Krogh, G., Haefliger, S., Spaeth, S., and Wallin, M. W. (2012). Carrots and Rainbows: Motivation and Social Practice in Open Source Software Development. MIS quarterly, 36(2):649–676. Place: Minneapolis Publisher: Management Information Systems Research Center, University of Minnesota.
  • Wagstrom, (2009) Wagstrom, P. A. (2009). Vertical interaction in open software engineering communities. PhD thesis, Carnegie Insitute of Technology/School of Computer Science, Carnegie Mellon University.
  • West, (2003) West, J. (2003). How open is open enough?: Melding proprietary and open source platform strategies. Research Policy, 32(7):1259–1285.
  • West and O’Mahony, (2005) West, J. and O’Mahony, S. (2005). Contrasting Community Building in Sponsored and Community Founded Open Source Projects. In Proceedings of the 38th Annual Hawaii International Conference on System Sciences, pages 196c–196c, Big Island, HI, USA. IEEE.
  • White et al., (2024) White, M., Haddad, I., Osborne, C., Xiao-Yang, Liu, Abdelmonsef, A., and Varghese, S. (2024). The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency and Usability in AI. arXiv:2403.13784 [cs].
  • Whittaker, (2021) Whittaker, M. (2021). The steep cost of capture. Interactions, 28(6):50–55.
  • Wichmann, (2002) Wichmann, T. (2002). Firms’ Open Source Activities: Motivations and Policy Implications. Technical report, Berlecon Research GmbH, Berlin, Germany.
  • Widder et al., (2023) Widder, D. G., West, S., and Whittaker, M. (2023). Open (For Business): Big Tech, Concentrated Power, and the Political Economy of Open AI.
  • Woods and Guliani, (2005) Woods, D. and Guliani, G. (2005). Open Source for the Enterprise: Managing Risks, Reaping Rewards. O’Reilly Media, Incorporated, Sebastopol.
  • Wu et al., (2007) Wu, C.-G., Gerlach, J. H., and Young, C. E. (2007). An empirical analysis of open source software developers’ motivations and continuance intentions. Information & Management, 44(3):253–262.
  • Xu et al., (2009) Xu, B., Jones, D. R., and Shao, B. (2009). Volunteers’ involvement in online community based software development. Information & Management, 46(3):151–158.
  • Xu et al., (2006) Xu, J., Christley, S., and Madey, G. (2006). 12 - Application of Social Network Analysis to the Study of Open Source Software. In Bitzer, J. and Schröder, P. J. H., editors, The Economics of Open Source Software Development, pages 247–269. Elsevier, Amsterdam.
  • Ye and Kishida, (2003) Ye, Y. and Kishida, K. (2003). Toward an understanding of the motivation of open source software developers. In 25th International Conference on Software Engineering, 2003. Proceedings., pages 419–429. ISSN: 0270-5257.
  • Yin, (2018) Yin, R. K. (2018). Case study research and applications: design and methods. Sage, Los Angeles, USA, sixth edition. edition.
  • Yu et al., (2007) Yu, J., Jiang, Z., and Chan, H. C. (2007). Knowledge contribution in problem solving virtual communities: the mediating role of individual motivations. In Proceedings of the 2007 ACM SIGMIS CPR conference on Computer personnel research: The global information technology workforce, SIGMIS CPR ’07, pages 144–152, New York, NY, USA. Association for Computing Machinery.
  • Yue and Nagle, (2024) Yue, D. and Nagle, F. (2024). Igniting Innovation: Evidence from PyTorch on Technology Control in Open Collaboration.
  • Zeitlyn, (2003) Zeitlyn, D. (2003). Gift economies in the development of open source software: anthropological reflections. Research Policy, 32(7):1287–1291.
  • Zhang et al., (2022) Zhang, Y., Stol, K.-J., Liu, H., and Zhou, M. (2022). Corporate dominance in open source ecosystems: a case study of OpenStack. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, pages 1048–1060, New York, NY, USA. Association for Computing Machinery.
  • Zhang et al., (2018) Zhang, Y., Tan, X., Zhou, M., and Jin, Z. (2018). Companies’ domination in FLOSS development: an empirical study of OpenStack. In Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, ICSE ’18, pages 440–441, New York, NY, USA. Association for Computing Machinery.
  • Zhang et al., (2021) Zhang, Y., Zhou, M., Mockus, A., and Jin, Z. (2021). Companies’ Participation in OSS Development–An Empirical Study of OpenStack. IEEE Transactions on Software Engineering, 47(10):2242–2259. Conference Name: IEEE Transactions on Software Engineering.
  • Zhang et al., (2020) Zhang, Y., Zhou, M., Stol, K.-J., Wu, J., and Jin, Z. (2020). How Do Companies Collaborate in Open Source Ecosystems? An Empirical Study of OpenStack. In 2020 IEEE/ACM 42nd International Conference on Software Engineering (ICSE), pages 1196–1208. ACM. ISSN: 1558-1225.
  • Zhou and Mockus, (2015) Zhou, M. and Mockus, A. (2015). Who Will Stay in the FLOSS Community? Modeling Participant’s Initial Behavior. IEEE transactions on software engineering, 41(1):82–99. Place: New York Publisher: IEEE.
  • Zhou et al., (2016) Zhou, M., Mockus, A., Ma, X., Zhang, L., and Mei, H. (2016). Inflow and Retention in OSS Communities with Commercial Involvement: A Case Study of Three Hybrid Projects. ACM Transactions on Software Engineering and Methodology, 25(2):13:1–13:29.

Appendix A OSS Donations to LF AI & Data Foundation and PyTorch Foundation

Table 5: OSS Donations to the LF AI & Data and PyTorch Foundations
Project Donor Date TAC Proposal Press Release
Acumos AT&T,
Tech Mahdra
2018-05 https://drive.google.com/file/d/1L6fFhZnFqeR3mwy8Ya/KVoRGzCUjdnyrM https://www.acumos.org/news/2018/11/14/lf-deep-learning-delivers-first-acumos-ai-release-making-it-easier-to-deploy-and-share-artificial-intelligence-models/
Angel Tencent 2018-08 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/18481216/GMT20191121-140452_LF-AI-Foun_1920x1080.mp4 https://www.linuxfoundation.org/press/press-release/lf-deep-learning-adds-two-new-framework-projects-to-expand-community-and-ecosystem
Egeria IBM, ING 2018-08 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30408948/September%2024%2C%202020_LF%20AI%20TAC%20Deck%202.pptx https://www.linuxfoundation.org/press/press-release/new-ai-data-foundation-combines-industrys-fastest-growing-open-source-developments-in-artificial-intelligence-and-open-data
Elastic
Deep
Learning
Baidu 2018-08 N/A https://www.linuxfoundation.org/press/press-release/lf-deep-learning-adds-two-new-framework-projects-to-expand-community-and-ecosystem
Horovod Uber 2018-12 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30408797/August%2013%2C%202020_LF%20AI%20TAC%20Deck%20-%20compressed2.pptx https://www.uber.com/en-GB/blog/horovod-deep-learning-foundation/
Pyro Uber 2019-01 https://drive.google.com/file/d/1B0ZkJUKVZoJxsaUkge/02kGKddiZX8DLj/view https://www.uber.com/en-GB/blog/pyro-lf-deep-learning-foundation/
Adlik ZTE 2019-09 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=%2F7733341%2F12091694%2FTAC+recording+08152019.mp4 https://lfaidata.foundation/blog/2019/10/21/lf-ai-welcomes-adlik-as-newest-incubation-project/
ONNX ONNX
Community
2019-11 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/18481160/TAC%2010-24-2019.mp4 https://cloudblogs.microsoft.com/opensource/2019/11/14/onnx-joins-linux-foundation/
Marquez WeWork 2019-12 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/18481417/TAC-12192019.mp4 N/A
sparklyr RStudio 2019-12 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/18481269/TAC-recording-12052019.mp4 https://lfaidata.foundation/blog/2020/01/29/sparklyr-joins-lf-ai-as-its-newest-incubation-project-scaling-data-science-and-machine-learning-workflows-using-apache-spark-and-r/
Milvus Zilliz 2020-01 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/22249639/January%2016%2C%202020_LF%20AI%20TAC%20Deck.pdf https://www.prnewswire.com/news-releases/milvus-the-ai-search-engine-originally-developed-by-zilliz-joins-lf-ai-as-new-incubation-project-301038716.html
OpenDS4All IBM, ODPi, UPenn 2020-02 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30408948/September%2024%2C%202020_LF%20AI%20TAC%20Deck%202.pptx https://community.ibm.com/community/user/ai-datascience/blogs/ana-echeverri1/2020/02/28/opends4all-is-live
NNStreamer Samsung 2020-03 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/24281106/March%2012%2C%202020_LF%20AI%20TAC%20Deck_v2.pdf https://research.samsung.com/news/LF-AI-Foundation-Announces-NNStreamer-as-Its-Newest-Incubation-Project
ForestFlow DreamWorks
Animation
2020-04 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/24281142/March%2026%2C%202020_LF%20AI%20TAC%20Deck.pdf https://research.dreamworks.com/dreamworks-animation-releases-forestflow-machine-learning-model-server-to-the-open-source-community/
Ludwig Uber 2020-05 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/24281544/Ludwig%20LFAI.pdf https://www.uber.com/blog/introducing-ludwig/
Adversarial Robustness Toolbox IBM 2020-06 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30409001/October%208%2C%202020_LF%20AI%20TAC%20Deck.pdf https://developer.ibm.com/open/centers/codait/trusted-ai/
AI Explainability 360 IBM 2020-06 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30409001/October%208%2C%202020_LF%20AI%20TAC%20Deck.pdf https://developer.ibm.com/open/centers/codait/trusted-ai/
AI Fairness 360 IBM 2020-06 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30409001/October%208%2C%202020_LF%20AI%20TAC%20Deck.pdf https://developer.ibm.com/open/centers/codait/trusted-ai/
Amundsen Lyft 2020-07 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341 https://eng.lyft.com/amundsen-1-year-later-7b60bf28602
DELTA DiDi 2020-09 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30408797/August%2013%2C%202020_LF%20AI%20TAC%20Deck%20-%20compressed2.pptx https://lfaidata.foundation/blog/2021/06/17/delta-joins-lf-ai-data-as-new-incubation-project/
Feast Gojek 2020-09 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/30408948/September%2024%2C%202020_LF%20AI%20TAC%20Deck%202.pptx https://feast.dev/blog/feast-joins-the-linux-foundation-for-ai-data/
SOAJS Herron Tech 2020-09 https://wiki.lfaidata.foundation/download/attachments/7733341/September%2010%2C%202020_LF%20AI%20TAC%20Deck%20-%20updated.pdf?version=1&modificationDate=1599741898000&api=v2 N/A
DataPractices DataPractices Org 2020-12 https://wiki.lfaidata.foundation/download/attachments/7733341/October%205%2C%202020_LF%20AI%20TAC%20Deck%282%29.pdf?version=1&modificationDate=1604497702000&api=v2 N/A
JanusGraph JanusGraph
Community
2021-01 https://wiki.lfaidata.foundation/download/attachments/7733341/December%203%2C%202020_LF%20AI%20TAC%20Deck.pdf?version=2&modificationDate=1606864208000&api=v2 https://lfaidata.foundation/blog/2021/01/12/janusgraph-joins-lf-ai-data-as-new-incubation-project/
Flyte Lyft 2021-02 https://wiki.lfaidata.foundation/download/attachments/7733341/February%2025%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1614021188000&api=v2 https://eng.lyft.com/flyte-joins-lf-ai-data-48c9b4b60eec
Datashim IBM 2021-03 https://wiki.lfaidata.foundation/download/attachments/7733341/January%2014%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1610631576000&api=v2 https://lfaidata.foundation/blog/2021/03/23/datashim-joins-lf-ai-data-as-new-incubation-project/
RosaeNLG BNP Paribas 2021-03 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/39092353/March%2011%2C%202021_LF%20AI%20TAC%20Deck(2).pdf https://lfaidata.foundation/blog/2021/04/28/rosaenlg-joins-lf-ai-data-as-new-sandbox-project/
Substra OWKIN 2021-03 https://wiki.lfaidata.foundation/download/attachments/7733341/March%2025%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1616590543000&api=v2 https://lfaidata.foundation/blog/2022/11/28/owkin-launches-open-science-push-by-open-sourcing-ai-software-substra-and-releasing-two-open-source-ai-innovations-at-neurips/
Kompute The Institute
for Ethical AI
2021-05 https://wiki.lfaidata.foundation/download/attachments/7733341/May%206%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=2&modificationDate=1620316765000&api=v2 https://lfaidata.foundation/blog/2021/08/26/kompute-joins-lf-ai-data-as-new-sandbox-project/
OpenLineage Datakin, IBM 2021-07 https://wiki.lfaidata.foundation/download/attachments/7733341/December%2015%202022_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1673523471000&api=v2 https://openlineage.io/blog/joining-lfai/
TonY LinkedIn 2018-09 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/43287136/July%2015%2C%202021_LF%20AI%20TAC%20Deck.pdf https://engineering.linkedin.com/blog/2018/09/open-sourcing-tony--native-support-of-tensorflow-on-hadoop
Kedro McKinsey
QuantumBlack
2021-08 https://wiki.lfaidata.foundation/download/attachments/7733341/August%2026%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1630089002000&api=v2 https://medium.com/quantumblack/kedro-joins-the-linux-foundation-to-become-an-open-standard-for-machine-learning-engineering-b0061152ff73
KServe KServe
Community
2021-11 https://wiki.lfaidata.foundation/download/attachments/7733341/October%2018%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1637936838000&api=v2 https://lfaidata.foundation/blog/2022/02/24/kserve-joins-lf-ai-data-as-new-incubation-project/
OpenBytes Graviti 2021-11 https://wiki.lfaidata.foundation/download/attachments/7733341/October%2021%2C%202021_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1635325659000&api=v2 https://www.linuxfoundation.org/press/press-release/linux-foundation-and-graviti-announce-project-openbytes-to-make-open-data-more-accessible-to-all
Artigraph Replica 2022-01 https://wiki.lfaidata.foundation/download/attachments/7733341/January%2027%2C%202022_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1643715756000&api=v2 https://lfaidata.foundation/uncategorized/2022/04/13/lf-ai-data-announces-artigraph-as-new-sandbox-project/
1chipML Ericsson 2022-04 https://wiki.lfaidata.foundation/download/attachments/7733341/April%207%2C%202022_LF%20AI%20TAC%20Deck%20%281%29.pdf?version=1&modificationDate=1650500517000&api=v2 https://lfaidata.foundation/blog/2022/07/21/lf-ai-data-announces-three-new-sandbox-projects/
BeyondML Squared AI 2022-06 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/61964452/June%2016%2C%202022_LF%20AI%20TAC%20Deck.pdf https://lfaidata.foundation/blog/2022/07/21/lf-ai-data-announces-three-new-sandbox-projects/
FlagAI BAAI 2022-06 https://wiki.lfaidata.foundation/download/attachments/7733341/June%2030%2C%202022_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1657325720000&api=v2 N/A
FedLCM VMWare 2022-08 https://wiki.lfaidata.foundation/download/attachments/7733341/July%2028%2C%202022_LF%20AI%20TAC%20Deck%20%281%29.pdf?version=1&modificationDate=1660155848000&api=v2 https://blogs.vmware.com/opensource/2022/10/27/open-source-project-fedlcm-to-the-lf-ai-data/
FATE LinkedIn 2022-08 https://wiki.lfaidata.foundation/download/attachments/7733341/August%2025%2C%202022_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1662390660000&api=v2 https://cloudblogs.microsoft.com/opensource/2022/09/12/feathr-feature-store-joins-lf-ai-data-foundation/
OpenDataology OpenDataology Community 2022-08 https://wiki.lfaidata.foundation/download/attachments/7733341/August%2011%2C%202022_LF%20AI%20TAC%20Deck.pdf?version=1&modificationDate=1661373254000&api=v2 N/A
PyTorch Meta 2022-09 N/A https://pytorch.org/blog/PyTorchfoundation/
Elyra IBM 2022-10 https://wiki.lfaidata.foundation/pages/viewpage.action?pageId=7733341&preview=/7733341/39092353/March%2011%2C%202021_LF%20AI%20TAC%20Deck(2).pdf https://developer.ibm.com/blogs/open-source-elyra-ai-toolkit-simplifies-data-model-development/