Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Phan, J

Searching in archive cs. Search in all archives.
.
  1. BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies

    Authors: Ratnadira Widyasari, Sheng Qin Sim, Camellia Lok, Haodi Qi, Jack Phan, Qijin Tay, Constance Tan, Fiona Wee, Jodie Ethelda Tan, Yuheng Yieh, Brian Goh, Ferdian Thung, Hong Jin Kang, Thong Hoang, David Lo, Eng Lieh Ouh

    Abstract: The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the a… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (2020) 1556-1560

  2. arXiv:2303.06286  [pdf, other

    cs.SE

    NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python

    Authors: Ratnadira Widyasari, Zhou Yang, Ferdian Thung, Sheng Qin Sim, Fiona Wee, Camellia Lok, Jack Phan, Haodi Qi, Constance Tan, Qijin Tay, David Lo

    Abstract: Machine learning (ML) has gained much attention and been incorporated into our daily lives. While there are numerous publicly available ML projects on open source platforms such as GitHub, there have been limited attempts in filtering those projects to curate ML projects of high quality. The limited availability of such a high-quality dataset poses an obstacle in understanding ML projects. To help… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by MSR 2023

  3. arXiv:1905.09247  [pdf, ps, other

    cs.CV

    Dual Active Sampling on Batch-Incremental Active Learning

    Authors: Johan Phan, Massimiliano Ruocco, Francesco Scibilia

    Abstract: Recently, Convolutional Neural Networks (CNNs) have shown unprecedented success in the field of computer vision, especially on challenging image classification tasks by relying on a universal approach, i.e., training a deep model on a massive dataset of supervised examples. While unlabeled data are often an abundant resource, collecting a large set of labeled data, on the other hand, are very expe… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Comments: 6 pages

  4. arXiv:1611.05751  [pdf, other

    cs.LG stat.ML

    A Multi-Modal Graph-Based Semi-Supervised Pipeline for Predicting Cancer Survival

    Authors: Hamid Reza Hassanzadeh, John H. Phan, May D. Wang

    Abstract: Cancer survival prediction is an active area of research that can help prevent unnecessary therapies and improve patient's quality of life. Gene expression profiling is being widely used in cancer studies to discover informative biomarkers that aid predict different clinical endpoint prediction. We use multiple modalities of data derived from RNA deep-sequencing (RNA-seq) to predict survival of ca… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: in 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

  5. arXiv:1509.08888  [pdf

    cs.LG

    A Semi-Supervised Method for Predicting Cancer Survival Using Incomplete Clinical Data

    Authors: Hamid Reza Hassanzadeh, John H. Phan, May D. Wang

    Abstract: Prediction of survival for cancer patients is an open area of research. However, many of these studies focus on datasets with a large number of patients. We present a novel method that is specifically designed to address the challenge of data scarcity, which is often the case for cancer datasets. Our method is able to use unlabeled data to improve classification by adopting a semi-supervised train… ▽ More

    Submitted 29 September, 2015; originally announced September 2015.