Zum Hauptinhalt springen

Showing 1–50 of 68 results for author: Sarma, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04626  [pdf, other

    cs.LG

    Adaptive Interface-PINNs (AdaI-PINNs): An Efficient Physics-informed Neural Networks Framework for Interface Problems

    Authors: Sumanta Roy, Chandrasekhar Annavarapu, Pratanu Roy, Antareep Kumar Sarma

    Abstract: We present an efficient physics-informed neural networks (PINNs) framework, termed Adaptive Interface-PINNs (AdaI-PINNs), to improve the modeling of interface problems with discontinuous coefficients and/or interfacial jumps. This framework is an enhanced version of its predecessor, Interface PINNs or I-PINNs (Sarma et al.; https://dx.doi.org/10.2139/ssrn.4766623), which involves domain decomposit… ▽ More

    Submitted 10 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 17 pages, 8 figures, 6 tables

  2. arXiv:2312.11719  [pdf, other

    cs.SE cs.HC

    How Far Are We? The Triumphs and Trials of Generative AI in Learning Software Engineering

    Authors: Rudrajit Choudhuri, Dylan Liu, Igor Steinmacher, Marco Gerosa, Anita Sarma

    Abstract: Conversational Generative AI (convo-genAI) is revolutionizing Software Engineering (SE) as engineers and academics embrace this technology in their work. However, there is a gap in understanding the current potential and pitfalls of this technology, specifically in supporting students in SE tasks. In this work, we evaluate through a between-subjects study (N=22) the effectiveness of ChatGPT, a con… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 13 pages, ICSE 2024

    Journal ref: ACM/IEEE 46th International Conference on Software Engineering (ICSE 2024)

  3. arXiv:2312.11431  [pdf, other

    cs.HC cs.SE

    Make It Make Sense! Understanding and Facilitating Sensemaking in Computational Notebooks

    Authors: Souti Chattopadhyay, Zixuan Feng, Emily Arteaga, Audrey Au, Gonzalo Ramos, Titus Barik, Anita Sarma

    Abstract: Reusing and making sense of other scientists' computational notebooks. However, making sense of existing notebooks is a struggle, as these reference notebooks are often exploratory, have messy structures, include multiple alternatives, and have little explanation. To help mitigate these issues, we developed a catalog of cognitive tasks associated with the sensemaking process. Utilizing this catalo… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 26 Pages

  4. arXiv:2312.08543  [pdf, other

    cs.SE

    Unveiling Diversity: Empowering OSS Project Leaders with Community Diversity and Turnover Dashboards

    Authors: Mariam Guizani, Zixuan Feng, Emily Judith Arteaga, Luis Cañas-Díaz, Alexander Serebrenik, Anita Sarma

    Abstract: Managing open-source software (OSS) projects requires managing communities of contributors. In particular, it is essential for project leaders to understand their community's diversity and turnover. We present CommunityTapestry, a dynamic real-time community dashboard, which presents key diversity and turnover signals that we identified from the literature and through participatory design sessions… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  5. arXiv:2311.11081  [pdf, other

    cs.SE

    Can AI Serve as a Substitute for Human Subjects in Software Engineering Research?

    Authors: Marco A. Gerosa, Bianca Trinkenreich, Igor Steinmacher, Anita Sarma

    Abstract: Research within sociotechnical domains, such as Software Engineering, fundamentally requires a thorough consideration of the human perspective. However, traditional qualitative data collection methods suffer from challenges related to scale, labor intensity, and the increasing difficulty of participant recruitment. This vision paper proposes a novel approach to qualitative data collection in softw… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  6. arXiv:2304.04873  [pdf

    cs.HC

    SocioEconomicMag Meets a Platform for SES-Diverse College Students: A Case Study

    Authors: Puja Agarwal, Divya Prem, Christopher Bogart, Abrar Fallatah, Aileen Abril Castro-Guzman, Pannapat Chanpaisaeng, Stella Doehring, Margaret Burnett, Anita Sarma

    Abstract: Emerging research shows that individual differences in how people use technology sometimes cluster by socioeconomic status (SES) and that when technology is not socioeconomically inclusive, low-SES individuals may abandon it. To understand how to improve technology's SES-inclusivity, we present a multi-phase case study on SocioEconomicMag (SESMag), an emerging inspection method for socio+economic… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 26 pages, 7 figures

  7. Tag that issue: Applying API-domain labels in issue tracking systems

    Authors: Fabio Santos, Joseph Vargovich, Bianca Trinkenreich, Italo Santos, Jacob Penney, Ricardo Britto, João Felipe Pimentel, Igor Wiese, Igor Steinmacher, Anita Sarma, Marco A. Gerosa

    Abstract: Labeling issues with the skills required to complete them can help contributors to choose tasks in Open Source Software projects. However, manually labeling issues is time-consuming and error-prone, and current automated approaches are mostly limited to classifying issues as bugs/non-bugs. We investigate the feasibility and relevance of automatically labeling issues with what we call "API-domains,… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2103.12653

    ACM Class: D.2.0; I.2.7; K.6.3

    Journal ref: Empirical Software Engineering (EMSE), 2023

  8. The State of Diversity and Inclusion in Apache: A Pulse Check

    Authors: Zixuan Feng, Mariam Guizani, Marco A. Gerosa, Anita Sarma

    Abstract: Diversity and inclusion in open source software (OSS) is a multifaceted concept that arises from differences in contributors' gender, seniority, language, region, and other characteristics. D&I has received growing attention in OSS ecosystems and projects, and various programs have been implemented to foster contributor diversity. However, we do not yet know how the state of D&I is evolving. By un… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 11 pages, 1 figure

    Journal ref: 16th International Conference on Cooperative and Human Aspects of Software Engineering (CHASE 2023)

  9. arXiv:2303.08266  [pdf, other

    cs.CY

    Rules of Engagement: Why and How Companies Participate in OSS

    Authors: Mariam Guizani, Aileen Abril Castro-Guzman, Anita Sarma, Igor Steinmacher

    Abstract: Company engagement in open source (OSS) is now the new norm. From large technology companies to startups, companies are participating in the OSS ecosystem by open-sourcing their technology, sponsoring projects through funding or paid developer time. However, our understanding of the OSS ecosystem is rooted in the 'old world' model where individual contributors sustain OSS projects. In this work, w… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 13 pages, 2 figures, ICSE 2023

  10. Designing for Cognitive Diversity: Improving the GitHub Experience for Newcomers

    Authors: Italo Santos, João Felipe Pimentel, Igor Wiese, Igor Steinmacher, Anita Sarma, Marco A. Gerosa

    Abstract: Social coding platforms such as GitHub have become defacto environments for collaborative programming and open source. When these platforms do not support specific cognitive styles, they create barriers to programming for some populations. Research shows that the cognitive styles typically favored by women are often unsupported, creating barriers to entry for woman newcomers. In this paper, we use… ▽ More

    Submitted 10 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Journal ref: 45th IEEE/ACM International Conference on Software Engineering, Software Engineering in Society (ICSE SEIS 2023)

  11. A Model for Understanding and Reducing Developer Burnout

    Authors: Bianca Trinkenreich, Klaas-Jan Stol, Igor Steinmacher, Marco Gerosa, Anita Sarma, Marcelo Lara, Michael Feathers, Nicholas Ross, Kevin Bishop

    Abstract: Job burnout is a type of work-related stress associated with a state of physical or emotional exhaustion that also involves a sense of reduced accomplishment and loss of personal identity. Burnt out can affect one's physical and mental health and has become a leading industry concern and can result in high workforce turnover. Through an empirical study at Globant, a large multi-national company, w… ▽ More

    Submitted 24 January, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Journal ref: 45th IEEE/ACM International Conference on Software Engineering, Software Engineering in Practice (ICSE SEIP 2023)

  12. Do I Belong? Modeling Sense of Virtual Community Among Linux Kernel Contributors

    Authors: Bianca Trinkenreich, Klaas-Jan Stol, Anita Sarma, Daniel M. German, Marco A. Gerosa, Igor Steinmacher

    Abstract: The sense of belonging to a community is a basic human need that impacts an individuals behavior, long-term engagement, and job satisfaction, as revealed by research in disciplines such as psychology, healthcare, and education. Despite much research on how to retain developers in Open Source Software projects and other virtual, peer-production communities, there is a paucity of research investigat… ▽ More

    Submitted 22 February, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Journal ref: 45th IEEE/ACM International Conference on Software Engineering (ICSE 2023)

  13. arXiv:2212.14694  [pdf, other

    physics.soc-ph cs.LG cs.SI

    Machine Learning as an Accurate Predictor for Percolation Threshold of Diverse Networks

    Authors: Siddharth Patwardhan, Utso Majumder, Aditya Das Sarma, Mayukha Pal, Divyanshi Dwivedi, Prasanta K. Panigrahi

    Abstract: The percolation threshold is an important measure to determine the inherent rigidity of large networks. Predictors of the percolation threshold for large networks are computationally intense to run, hence it is a necessity to develop predictors of the percolation threshold of networks, that do not rely on numerical simulations. We demonstrate the efficacy of five machine learning-based regression… ▽ More

    Submitted 25 March, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  14. arXiv:2203.02585  [pdf, other

    cs.NI

    NFSlicer: Data Movement Optimization for Shallow Network Functions

    Authors: Anirudh Sarma, Hamed Seyedroudbari, Harshit Gupta, Umakishore Ramachandran, Alexandros Daglis

    Abstract: Network Function (NF) deployments on commodity servers have become ubiquitous in datacenters and enterprise settings. Many commonly used NFs such as firewalls, load balancers and NATs are shallow - i.e., they only examine the packet's header, despite the entire packet being transferred on and off the server. As a result, the gap between moved and inspected data when handling large packets exceeds… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 13 pages, 16 figures

  15. arXiv:2203.01764  [pdf, other

    quant-ph cs.CV eess.IV

    Random Quantum Neural Networks (RQNN) for Noisy Image Recognition

    Authors: Debanjan Konar, Erol Gelenbe, Soham Bhandary, Aditya Das Sarma, Attila Cangi

    Abstract: Classical Random Neural Networks (RNNs) have demonstrated effective applications in decision making, signal processing, and image recognition tasks. However, their implementation has been limited to deterministic digital systems that output probability distributions in lieu of stochastic behaviors of random spiking signals. We introduce the novel class of supervised Random Quantum Neural Networks… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  16. arXiv:2202.13303  [pdf, other

    cs.SE cs.CY cs.HC

    How to Debug Inclusivity Bugs? A Debugging Process with Information Architecture

    Authors: Mariam Guizani, Igor Steinmacher, Jillian Emard, Abrar Fallatah, Margaret Burnett, Anita Sarma

    Abstract: Although some previous research has found ways to find inclusivity bugs (biases in software that introduce inequities), little attention has been paid to how to go about fixing such bugs. Without a process to move from finding to fixing, acting upon such findings is an ad-hoc activity, at the mercy of the skills of each individual developer. To address this gap, we created Why/Where/Fix, a systema… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: 12 pages, Accepted at ICSE SEIS 2022

  17. Perceptions of the State of D&I and D&I Initiative in the ASF

    Authors: Mariam Guizani, Bianca Trinkenreich, Aileen Abril Castro-Guzman, Igor Steinmacher, Marco Gerosa, Anita Sarma

    Abstract: Open Source Software (OSS) Foundations and projects are investing in creating Diversity and Inclusion (D&I) initiatives. However, little is known about contributors' perceptions about the usefulness and success of such initiatives. We aim to close this gap by investigating how contributors perceive the state of D&I in their community. In collaboration with the Apache Software Foundation (ASF), we… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: 13 pages, Accepted at ICSE SEIS 2022

    Journal ref: ACM/IEEE 44rd International Conference on Software Engineering, SEIS - Software Engineering in Society (ICSE SEIS 2022)

  18. arXiv:2202.11300  [pdf, other

    cs.SE

    Implicit Mentoring: The Unacknowledged Developer Efforts in Open Source

    Authors: Zixuan Feng, Amreeta Chatterjee, Anita Sarma, Iftekhar Ahmed

    Abstract: Mentoring is traditionally viewed as a dyadic, top-down apprenticeship. This perspective, however, overlooks other forms of informal mentoring taking place in everyday activities in which developers invest time and effort, but remain unacknowledged. Here, we investigate the different flavors of mentoring in Open Source Software (OSS) to define and identify implicit mentoring. We first define impli… ▽ More

    Submitted 3 February, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 12 pages, 3 figures

  19. arXiv:2202.07740  [pdf, other

    cs.SE cs.CY cs.HC

    Attracting and Retaining OSS Contributors with a Maintainer Dashboard

    Authors: Mariam Guizani, Thomas Zimmermann, Anita Sarma, Denae Ford

    Abstract: Tools and artifacts produced by open source software (OSS) have been woven into the foundation of the technology industry. To keep this foundation intact, the open source community needs to actively invest in sustainable approaches to bring in new contributors and nurture existing ones. We take a first step at this by collaboratively designing a maintainer dashboard that provides recommendations o… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 5 pages, Accepted at ICSE SEIS 2022

  20. arXiv:2201.10643  [pdf, other

    cs.HC cs.AI

    Intersectionality Goes Analytical: Taming Combinatorial Explosion Through Type Abstraction

    Authors: Margaret Burnett, Martin Erwig, Abrar Fallatah, Christopher Bogart, Anita Sarma

    Abstract: HCI researchers' and practitioners' awareness of intersectionality has been expanding, producing knowledge, recommendations, and prototypes for supporting intersectional populations. However, doing intersectional HCI work is uniquely expensive: it leads to a combinatorial explosion of empirical work (expense 1), and little of the work on one intersectional population can be leveraged to serve anot… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  21. arXiv:2112.14206  [pdf, ps, other

    cs.IT math.AG

    A survey on product codes and 2-D codes

    Authors: Amajit Sarma

    Abstract: One of the simplest way of combining codes to form new codes is to take their direct product. Direct product of cyclic codes and various generalizations have been studied for many years. In this note, we survey cyclic product codes, direct product of various generalizations of cyclic codes and their properties.

    Submitted 31 December, 2021; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: 40 pages, typos corrected, references added. arXiv admin note: text overlap with arXiv:1512.06690, arXiv:1505.02238, arXiv:1301.6231 by other authors

  22. arXiv:2109.07710  [pdf, other

    cs.LG cs.CV

    Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs

    Authors: Anup Sarma, Sonali Singh, Huaipan Jiang, Ashutosh Pattnaik, Asit K Mishra, Vijaykrishnan Narayanan, Mahmut T Kandemir, Chita R Das

    Abstract: Machine/deep-learning (ML/DL) based techniques are emerging as a driving force behind many cutting-edge technologies, achieving high accuracy on computer vision workloads such as image classification and object detection. However, training these models involving large parameters is both time-consuming and energy-hogging. In this regard, several prior works have advocated for sparsity to speed up t… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  23. arXiv:2107.08141  [pdf, other

    cs.HC

    An Automated Approach to Reasoning About Task-Oriented Insights in Responsive Visualization

    Authors: Hyeok Kim, Ryan Rossi, Abhraneel Sarma, Dominik Moritz, Jessica Hullman

    Abstract: Authors often transform a large screen visualization for smaller displays through rescaling, aggregation and other techniques when creating visualizations for both desktop and mobile devices (i.e., responsive visualization). However, transformations can alter relationships or patterns implied by the large screen view, requiring authors to reason carefully about what information to preserve while a… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 9 pages, 11 figures, 2 tables. Accepted at IEEE VIS 2021

  24. arXiv:2106.12089  [pdf, other

    cs.LG cs.CL cs.PF

    Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training

    Authors: Anup Sarma, Sonali Singh, Huaipan Jiang, Rui Zhang, Mahmut T Kandemir, Chita R Das

    Abstract: Recurrent Neural Networks (RNNs), more specifically their Long Short-Term Memory (LSTM) variants, have been widely used as a deep learning tool for tackling sequence-based learning tasks in text and speech. Training of such LSTM applications is computationally intensive due to the recurrent nature of hidden state computation that repeats for each time step. While sparsity in Deep Neural Nets has b… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

  25. Pots of Gold at the End of the Rainbow: What is Success for Open Source Contributors?

    Authors: Bianca Trinkenreich, Mariam Guizani, Igor Wiese, Tayana Conte, Marco Gerosa, Anita Sarma, Igor Steinmacher

    Abstract: Success in Open Source Software (OSS) is often perceived as an exclusively code-centric endeavor. This perception can exclude a variety of individuals with a diverse set of skills and backgrounds, in turn helping create the current diversity & inclusion imbalance in OSS. Because people's perspectives of success affect their personal, professional, and life choices, to be able to support a diverse… ▽ More

    Submitted 20 July, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Journal ref: IEEE Transactions on Software Engineering (TSE), 2021

  26. Women's Participation in Open Source Software: A Survey of the Literature

    Authors: Bianca Trinkenreich, Igor Wiese, Anita Sarma, Marco Gerosa, Igor Steinmacher

    Abstract: Participation of women in Open Source Software (OSS) is very unbalanced, despite various efforts to improve diversity. This is concerning not only because women do not get the chance of career and skill developments afforded by OSS, but also because OSS projects suffer from a lack of diversity of thoughts because of a lack of diversity in their projects. Studies that characterize women's participa… ▽ More

    Submitted 5 January, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

    Journal ref: ACM Transactions on Software Engineering and Methodology (TOSEM 2022)

  27. Can I Solve It? Identifying APIs Required to Complete OSS Task

    Authors: Fabio Santos, Igor Wiese, Bianca Trinkenreich, Igor Steinmacher, Anita Sarma, Marco Gerosa

    Abstract: Open Source Software projects add labels to open issues to help contributors choose tasks. However, manually labeling issues is time-consuming and error-prone. Current automatic approaches for creating labels are mostly limited to classifying issues as a bug/non-bug. In this paper, we investigate the feasibility and relevance of labeling issues with the domain of the APIs required to complete the… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Journal ref: Mining Software Repositories Conference (MSR 2021), May 2021

  28. The Shifting Sands of Motivation: Revisiting What Drives Contributors in Open Source

    Authors: Marco Gerosa, Igor Wiese, Bianca Trinkenreich, Georg Link, Gregorio Robles, Christoph Treude, Igor Steinmacher, Anita Sarma

    Abstract: Open Source Software (OSS) has changed drastically over the last decade, with OSS projects now producing a large ecosystem of popular products, involving industry participation, and providing professional career opportunities. But our field's understanding of what motivates people to contribute to OSS is still fundamentally grounded in studies from the early 2000s. With the changed landscape of OS… ▽ More

    Submitted 29 January, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: Paper accept at 43rd ACM/IEEE International Conference on Software Engineering, Madrid, Spain. June 2021

    Journal ref: ACM/IEEE 43rd International Conference on Software Engineering (ICSE 2021)

  29. arXiv:2012.12518  [pdf, ps, other

    cs.CR

    If This Context Then That Concern: Exploring users' concerns with IFTTT applets

    Authors: Mahsa Saeidi, McKenzie Calvert, Audrey W. Au, Anita Sarma, Rakesh B. Bobba

    Abstract: End users are increasingly using trigger-action platforms like, If-This-Then-That (IFTTT) to create applets to connect smart home devices and services. However, there are inherent risks in using such applets -- even non-malicious ones -- as sensitive information may leak through their use in certain contexts (e.g., where the device is located, who can observe the resultant action). This work aims… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

  30. arXiv:2006.03844  [pdf, other

    cs.SE

    Replacements and Replaceables: Making the Case for Code Variants

    Authors: Venkatesh Vinayakarao, Devika Sondhi, Sumit Keswani, Rahul Purandare, Anita Sarma

    Abstract: There are often multiple ways to implement the same requirement in source code. Different implementation choices can result in code snippets that are similar, and have been defined in multiple ways: code clones, examples, simions and variants. Currently, there is a lack of a consistent and unambiguous definition of such types of code snippets. Here we present a characterization study of code varia… ▽ More

    Submitted 14 June, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    ACM Class: D.2.m; D.2.13

  31. arXiv:1909.04226  [pdf, other

    quant-ph cs.LG

    Quantum Unsupervised and Supervised Learning on Superconducting Processors

    Authors: Abhijat Sarma, Rupak Chatterjee, Kaitlin Gili, Ting Yu

    Abstract: Machine learning algorithms perform well on identifying patterns in many different datasets due to their versatility. However, as one increases the size of the dataset, the computation time for training and using these statistical models grows quickly. Quantum computing offers a new paradigm which may have the ability to overcome these computational difficulties. Here, we propose a quantum analogu… ▽ More

    Submitted 6 January, 2022; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Updated to the published version

    Journal ref: Quantum Information and Computation 20 (7&8), 541-552 (2020)

  32. arXiv:1905.10361  [pdf

    cs.SE cs.CY

    Engineering Gender-Inclusivity into Software: Tales from the Trenches

    Authors: Claudia Hilderbrand, Christopher Perdriau, Lara Letaw, Jillian Emard, Zoe Steine-Hanson, Margaret Burnett, Anita Sarma

    Abstract: Although the need for gender-inclusivity in software itself is gaining attention among both SE researchers and SE practitioners, and methods have been published to help, little has been reported on how to make such methods work in real-world settings. For example, how do busy software practitioners use such methods in low-cost ways? How do they endeavor to maximize benefits from using them? How do… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  33. arXiv:1905.02813  [pdf

    cs.HC

    Fixing Inclusivity Bugs for Information Processing Styles and Learning Styles

    Authors: Zoe Steine-Hanson, Claudia Hilderbrand, Lara Letaw, Jillian Emard, Christopher Perdriau, Christopher Mendez, Margaret Burnett, Anita Sarma

    Abstract: Most software systems today do not support cognitive diversity. Further, because of differences in problem-solving styles that cluster by gender, software that poorly supports cognitive diversity can also embed gender biases. To help software professionals fix gender bias "bugs" related to people's problem-solving styles for information processing and learning of new software we collected inclusiv… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  34. arXiv:1905.02812  [pdf

    cs.HC

    From GenderMag to InclusiveMag: An Inclusive Design Meta-Method

    Authors: Christopher Mendez, Lara Letaw, Margaret Burnett, Simone Stumpf, Anita Sarma, Claudia Hilderbrand

    Abstract: How can software practitioners assess whether their software supports diverse users? Although there are empirical processes that can be used to find "inclusivity bugs" piecemeal, what is often needed is a systematic inspection method to assess soft-ware's support for diverse populations. To help fill this gap, this paper introduces InclusiveMag, a generalization of GenderMag that can be used to ge… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  35. arXiv:1803.00263  [pdf, other

    cs.SI physics.soc-ph

    EvoCut : A new Generalization of Albert-Barabási Model for Evolution of Complex Networks

    Authors: Shailesh Kumar Jaiswal, Nabajyoti Medhi, Manjish Pal, Mridul Sahu, Prashant Sahu, Amal Dev Sarma

    Abstract: With the evolution of social networks, the network structure shows dynamic nature in which nodes and edges appear as well as disappear for various reasons. The role of a node in the network is presented as the number of interactions it has with the other nodes. For this purpose a network is modeled as a graph where nodes represent network members and edges represent a relationship among them. Seve… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

  36. arXiv:1701.06207  [pdf, other

    cs.HC

    Understanding Workers, Developing Effective Tasks, and Enhancing Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace

    Authors: Ayush Jain, Akash Das Sarma, Aditya Parameswaran, Jennifer Widom

    Abstract: We conduct an experimental analysis of a dataset comprising over 27 million microtasks performed by over 70,000 workers issued to a large crowdsourcing marketplace between 2012-2016. Using this data---never before analyzed in an academic context---we shed light on three crucial aspects of crowdsourcing: (1) Task design --- helping requesters understand what constitutes an effective task, and how t… ▽ More

    Submitted 22 January, 2017; originally announced January 2017.

  37. arXiv:1610.05377  [pdf, other

    cs.HC

    Optimizing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management

    Authors: Aditya Parameswaran, Akash Das Sarma, Vipul Venkataraman

    Abstract: Crowdsourcing is the primary means to generate training data at scale, and when combined with sophisticated machine learning algorithms, crowdsourcing is an enabler for a variety of emergent automated applications impacting all spheres of our lives. This paper surveys the emerging field of formally reasoning about and optimizing open-ended crowdsourcing, a popular and crucially important, but seve… ▽ More

    Submitted 17 October, 2016; originally announced October 2016.

  38. arXiv:1503.00306  [pdf, other

    cs.DB

    Fusing Data with Correlations

    Authors: Ravali Pochampally, Anish Das Sarma, Xin Luna Dong, Alexandra Meliou, Divesh Srivastava

    Abstract: Many applications rely on Web data and extraction systems to accomplish knowledge-driven tasks. Web information is not curated, so many sources provide inaccurate, or conflicting information. Moreover, extraction systems introduce additional noise to the data. We wish to automatically distinguish correct data and erroneous data for creating a cleaner set of integrated data. Previous work has shown… ▽ More

    Submitted 1 March, 2015; originally announced March 2015.

    Comments: Sigmod'2014

  39. arXiv:1502.07710  [pdf, other

    cs.OH

    Globally Optimal Crowdsourcing Quality Management

    Authors: Akash Das Sarma, Aditya Parameswaran, Jennifer Widom

    Abstract: We study crowdsourcing quality management, that is, given worker responses to a set of tasks, our goal is to jointly estimate the true answers for the tasks, as well as the quality of the workers. Prior work on this problem relies primarily on applying Expectation-Maximization (EM) on the underlying maximum likelihood problem to estimate true answers as well as worker quality. Unfortunately, EM on… ▽ More

    Submitted 1 March, 2015; v1 submitted 26 February, 2015; originally announced February 2015.

  40. arXiv:1404.5351  [pdf, other

    cs.CV

    Fast Approximate Matching of Cell-Phone Videos for Robust Background Subtraction

    Authors: Raffay Hamid, Atish Das Sarma, Dennis DeCoste, Neel Sundaresan

    Abstract: We identify a novel instance of the background subtraction problem that focuses on extracting near-field foreground objects captured using handheld cameras. Given two user-generated videos of a scene, one with and the other without the foreground object(s), our goal is to efficiently generate an output video with only the foreground object(s) present in it. We cast this challenge as a spatio-tempo… ▽ More

    Submitted 21 April, 2014; originally announced April 2014.

  41. arXiv:1310.5407  [pdf, other

    cs.DC cs.DS

    Distributed Computation of Sparse Cuts

    Authors: Atish Das Sarma, Anisur Rahaman Molla, Gopal Pandurangan

    Abstract: Finding sparse cuts is an important tool in analyzing large-scale distributed networks such as the Internet and Peer-to-Peer networks, as well as large-scale graphs such as the web graph, online social communities, and VLSI circuits. In distributed communication networks, they are useful for topology maintenance and for designing better search and routing algorithms. In this paper, we focus on d… ▽ More

    Submitted 20 October, 2013; originally announced October 2013.

    Comments: 19 pages, 1 figure

  42. arXiv:1302.4544  [pdf, ps, other

    cs.DC cs.DS

    Distributed Random Walks

    Authors: Atish Das Sarma, Danupon Nanongkai, Gopal Pandurangan, Prasad Tetali

    Abstract: Performing random walks in networks is a fundamental primitive that has found applications in many areas of computer science, including distributed computing. In this paper, we focus on the problem of sampling random walks efficiently in a distributed network and its applications. Given bandwidth constraints, the goal is to minimize the number of rounds required to obtain random walk samples. Al… ▽ More

    Submitted 19 February, 2013; originally announced February 2013.

    Comments: Preprint of an article to appear in Journal of the ACM in February 2013. The official journal version has several gramatical corrections. Preliminary versions of this paper appeared in PODC 2009 and PODC 2010. arXiv admin note: substantial text overlap with arXiv:0911.3195, arXiv:1205.5525

    ACM Class: F.2.2; G.2.2

  43. Fast Distributed PageRank Computation

    Authors: Atish Das Sarma, Anisur Rahaman Molla, Gopal Pandurangan, Eli Upfal

    Abstract: Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google's search engine). In distributed computing alone, PageRank vector, or more generally random walk based quantities have been used for several different applications ranging… ▽ More

    Submitted 25 November, 2015; v1 submitted 15 August, 2012; originally announced August 2012.

    Comments: 14 pages

    Journal ref: Theoretical Computer Science, Volume 561, Pages 113-121, 2015

  44. arXiv:1208.1454  [pdf, ps, other

    cs.DC cs.DS

    Dense Subgraphs on Dynamic Networks

    Authors: Atish Das Sarma, Ashwin Lall, Danupon Nanongkai, Amitabh Trehan

    Abstract: In distributed networks, it is often useful for the nodes to be aware of dense subgraphs, e.g., such a dense subgraph could reveal dense subtructures in otherwise sparse graphs (e.g. the World Wide Web or social networks); these might reveal community clusters or dense regions for possibly maintaining good communication infrastructure. In this work, we address the problem of self-awareness of node… ▽ More

    Submitted 7 August, 2012; originally announced August 2012.

    Comments: To appear in the 26th International Symposium on Distributed Computing (DISC 2012)

    ACM Class: C.2.4; F.0; F.2.2; G.2.2

  45. arXiv:1206.4377  [pdf, other

    cs.DC cs.DS

    Upper and Lower Bounds on the Cost of a Map-Reduce Computation

    Authors: Foto N. Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey D. Ullman

    Abstract: In this paper we study the tradeoff between parallelism and communication cost in a map-reduce computation. For any problem that is not "embarrassingly parallel," the finer we partition the work of the reducers so that more parallelism can be extracted, the greater will be the total communication between mappers and reducers. We introduce a model of problems that can be solved in a single round of… ▽ More

    Submitted 19 June, 2012; originally announced June 2012.

    Comments: 14 pages

  46. arXiv:1205.5525  [pdf, other

    cs.DC cs.DS

    Fast Distributed Computation in Dynamic Networks via Random Walks

    Authors: Atish Das Sarma, Anisur Rahaman Molla, Gopal Pandurangan

    Abstract: The paper investigates efficient distributed computation in dynamic networks in which the network topology changes (arbitrarily) from round to round. Our first contribution is a rigorous framework for design and analysis of distributed random walk algorithms in dynamic networks. We then develop a fast distributed random walk based algorithm that runs in $\tilde{O}(\sqrt{τΦ})$ rounds (with high p… ▽ More

    Submitted 24 May, 2012; originally announced May 2012.

  47. arXiv:1204.4346  [pdf, ps, other

    cs.DL cs.CL cs.SI physics.soc-ph

    Your Two Weeks of Fame and Your Grandmother's

    Authors: James Cook, Atish Das Sarma, Alex Fabrikant, Andrew Tomkins

    Abstract: Did celebrity last longer in 1929, 1992 or 2009? We investigate the phenomenon of fame by mining a collection of news articles that spans the twentieth century, and also perform a side study on a collection of blog posts from the last 10 years. By analyzing mentions of personal names, we measure each person's time in the spotlight, using two simple metrics that evaluate, roughly, the duration of a… ▽ More

    Submitted 19 April, 2012; originally announced April 2012.

    Comments: This version supercedes the short version of this paper published in the proceedings of WWW 2012

    ACM Class: J.4

  48. arXiv:1204.1754  [pdf, other

    cs.DB cs.DC

    Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation

    Authors: Foto N. Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey D. Ullman

    Abstract: A significant amount of recent research work has addressed the problem of solving various data management problems in the cloud. The major algorithmic challenges in map-reduce computations involve balancing a multitude of factors such as the number of machines available for mappers/reducers, their memory requirements, and communication cost (total amount of data sent from mappers to reducers). Mos… ▽ More

    Submitted 8 April, 2012; originally announced April 2012.

    Comments: 5 pages

  49. arXiv:1203.5387  [pdf, ps, other

    cs.DS cs.DB

    Finding Connected Components on Map-reduce in Logarithmic Rounds

    Authors: Vibhor Rastogi, Ashwin Machanavajjhala, Laukik Chitnis, Anish Das Sarma

    Abstract: Given a large graph G = (V,E) with millions of nodes and edges, how do we compute its connected components efficiently? Recent work addresses this problem in map-reduce, where a fundamental trade-off exists between the number of map-reduce rounds and the communication of each round. Denoting d the diameter of the graph, and n the number of nodes in the largest component, all prior map-reduce techn… ▽ More

    Submitted 12 November, 2012; v1 submitted 24 March, 2012; originally announced March 2012.

  50. arXiv:1201.1363  [pdf, other

    cs.DC cs.DS

    Near-Optimal Random Walk Sampling in Distributed Networks

    Authors: Atish Das Sarma, Anisur Rahaman Molla, Gopal Pandurangan

    Abstract: Performing random walks in networks is a fundamental primitive that has found numerous applications in communication networks such as token management, load balancing, network topology discovery and construction, search, and peer-to-peer membership management. While several such algorithms are ubiquitous, and use numerous random walk samples, the walks themselves have always been performed naively… ▽ More

    Submitted 11 January, 2012; v1 submitted 6 January, 2012; originally announced January 2012.