Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Leung, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13663  [pdf, other

    stat.CO cs.NE

    Studying the Performance of the Jellyfish Search Optimiser for the Application of Projection Pursuit

    Authors: H. Sherry Zhang, Dianne Cook, Nicolas Langrené, Jessica Wai Yin Leung

    Abstract: The projection pursuit (PP) guided tour interactively optimises a criteria function known as the PP index, to explore high-dimensional data by revealing interesting projections. The optimisation in PP can be non-trivial, involving non-smooth functions and optima with a small squint angle, detectable only from close proximity. To address these challenges, this study investigates the performance of… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2404.10242  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

    Authors: Oren Kraus, Kian Kenyon-Dean, Saber Saberian, Maryam Fallah, Peter McLean, Jess Leung, Vasudev Sharma, Ayla Khan, Jia Balakrishnan, Safiye Celik, Dominique Beaini, Maciej Sypetkowski, Chi Vicky Cheng, Kristen Morse, Maureen Makes, Ben Mabey, Berton Earnshaw

    Abstract: Featurizing microscopy images for use in biological research remains a significant challenge, especially for large-scale experiments spanning millions of images. This work explores the scaling properties of weakly supervised classifiers and self-supervised masked autoencoders (MAEs) when training with increasingly larger model backbones and microscopy datasets. Our results show that ViT-based MAEs… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Highlight. arXiv admin note: text overlap with arXiv:2309.16064

  3. arXiv:2401.14043  [pdf, other

    cs.CL cs.AI

    Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey

    Authors: Haochen Li, Jonathan Leung, Zhiqi Shen

    Abstract: Large Language Models (LLMs) have shown prominent performance in various downstream tasks and prompt engineering plays a pivotal role in optimizing LLMs' performance. This paper, not only as an overview of current prompt engineering methods, but also aims to highlight the limitation of designing prompts based on an anthropomorphic assumption that expects LLMs to think like humans. From our review… ▽ More

    Submitted 17 June, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: An up-to-date resource including papers and tasks is maintained at https://github.com/Alex-HaochenLi/Goal-oriented-Prompt-Engineering

  4. arXiv:2309.16064  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Autoencoders are Scalable Learners of Cellular Morphology

    Authors: Oren Kraus, Kian Kenyon-Dean, Saber Saberian, Maryam Fallah, Peter McLean, Jess Leung, Vasudev Sharma, Ayla Khan, Jia Balakrishnan, Safiye Celik, Maciej Sypetkowski, Chi Vicky Cheng, Kristen Morse, Maureen Makes, Ben Mabey, Berton Earnshaw

    Abstract: Inferring biological relationships from cellular phenotypes in high-content microscopy screens provides significant opportunity and challenge in biological research. Prior results have shown that deep vision models can capture biological signal better than hand-crafted features. This work explores how self-supervised deep learning approaches scale when training larger models on larger microscopy d… ▽ More

    Submitted 27 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Spotlight at NeurIPS 2023 Generative AI and Biology (GenBio) Workshop

  5. arXiv:2307.03718  [pdf, other

    cs.CY cs.AI

    Frontier AI Regulation: Managing Emerging Risks to Public Safety

    Authors: Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

    Abstract: Advanced AI models hold the promise of tremendous benefits for humanity, but society needs to proactively manage the accompanying risks. In this paper, we focus on what we term "frontier AI" models: highly capable foundation models that could possess dangerous capabilities sufficient to pose severe risks to public safety. Frontier AI models pose a distinct regulatory challenge: dangerous capabilit… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Update July 11th: - Added missing footnote back in. - Adjusted author order (mistakenly non-alphabetical among the first 6 authors) and adjusted affiliations (Jess Whittlestone's affiliation was mistagged and Gillian Hadfield had SRI added to her affiliations) Updated September 4th: Various typos

  6. arXiv:2305.15324  [pdf, other

    cs.AI

    Model evaluation for extreme risks

    Authors: Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul Christiano, Allan Dafoe

    Abstract: Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further progress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify danger… ▽ More

    Submitted 22 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Fixed typos; added citation

    ACM Class: K.4.1

  7. arXiv:2305.04796  [pdf

    cs.IR cs.LG

    The Application of Affective Measures in Text-based Emotion Aware Recommender Systems

    Authors: John Kalung Leung, Igor Griva, William G. Kennedy, Jason M. Kinser, Sohyun Park, Seo Young Lee

    Abstract: This paper presents an innovative approach to address the problems researchers face in Emotion Aware Recommender Systems (EARS): the difficulty and cumbersome collecting voluminously good quality emotion-tagged datasets and an effective way to protect users' emotional data privacy. Without enough good-quality emotion-tagged datasets, researchers cannot conduct repeatable affective computing resear… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  8. arXiv:2304.07787  [pdf, other

    cs.LG

    Harnessing Digital Pathology And Causal Learning To Improve Eosinophilic Esophagitis Dietary Treatment Assignment

    Authors: Eliel Aknin, Ariel Larey, Julie M. Caldwell, Margaret H. Collins, Juan P. Abonia, Seema S. Aceves, Nicoleta C. Arva, Mirna Chehade, Evan S. Dellon, Nirmala Gonsalves, Sandeep K. Gupta, John Leung, Kathryn A. Peterson, Tetsuo Shoda, Jonathan M. Spergel, Marc E. Rothenberg, Yonatan Savir

    Abstract: Eosinophilic esophagitis (EoE) is a chronic, food antigen-driven, allergic inflammatory condition of the esophagus associated with elevated esophageal eosinophils. EoE is a top cause of chronic dysphagia after GERD. Diagnosis of EoE relies on counting eosinophils in histological slides, a manual and time-consuming task that limits the ability to extract complex patient-dependent features. The trea… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 11 pages, 5 figures

  9. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  10. arXiv:2112.07858  [pdf, other

    cs.HC cs.IR cs.LG

    EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In-Situ Code Search and Recommendation

    Authors: Xingjun Li, Yizhi Zhang, Justin Leung, Chengnian Sun, Jian Zhao

    Abstract: Using computational notebooks (e.g., Jupyter Notebook), data scientists rationalize their exploratory data analysis (EDA) based on their prior experience and external knowledge such as online examples. For novices or data scientists who lack specific knowledge about the dataset or problem to investigate, effectively obtaining and understanding the external information is critical to carry out EDA.… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  11. An Affective Aware Pseudo Association Method to Connect Disjoint Users Across Multiple Datasets -- An Enhanced Validation Method for Text-based Emotion Aware Recommender

    Authors: John Kalung Leung, Igor Griva, William G. Kennedy

    Abstract: We derive a method to enhance the evaluation for a text-based Emotion Aware Recommender that we have developed. However, we did not implement a suitable way to assess the top-N recommendations subjectively. In this study, we introduce an emotion-aware Pseudo Association Method to interconnect disjointed users across different datasets so data files can be combined to form a more extensive data fil… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: 21 pages, 9 tables. arXiv admin note: substantial text overlap with arXiv:2007.01455

    Journal ref: International Journal on Natural Language Computing (IJNLC) Vol. 9, No. 4, August 2020

  12. Applying the Affective Aware Pseudo Association Method to Enhance the Top-N Recommendations Distribution to Users in Group Emotion Recommender Systems

    Authors: John Kalung Leung, Igor Griva, William G. Kennedy

    Abstract: Recommender Systems are a subclass of information retrieval systems, or more succinctly, a class of information filtering systems that seeks to predict how close is the match of the user's preference to a recommended item. A common approach for making recommendations for a user group is to extend Personalized Recommender Systems' capability. This approach gives the impression that group recommenda… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: 19 pages, 9 tables

    Journal ref: International Journal on Natural Language Computing (IJNLC) Vol. 10, No. 1, February 2021

  13. arXiv:2101.02768  [pdf, other

    cs.HC cs.SE

    EmoconLite: Bridging the Gap Between Emotiv and Play for Children With Severe Disabilities

    Authors: Javad Rahimipour Anaraki, Chelsea Anne Rauh, Jason Leung, Tom Chau

    Abstract: Brain-computer interfaces (BCIs) allow users to control computer applications by modulating their brain activity. Since BCIs rely solely on brain activity, they have enormous potential as an alternative access method for engaging children with severe disabilities and/or medical complexities in therapeutic recreation and leisure. In particular, one commercially available BCI platform is the Emotiv… ▽ More

    Submitted 25 May, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

    Comments: 12 pages, 3 figures

    MSC Class: 92C55; 68-04 ACM Class: H.5.2; I.5.4

  14. Making Cross-Domain Recommendations by Associating Disjoint Users and Items Through the Affective Aware Pseudo Association Method

    Authors: John Kalung Leung, Igor Griva, William G. Kennedy

    Abstract: This paper utilizes an ingenious text-based affective aware pseudo association method (AAPAM) to link disjoint users and items across different information domains and leverage them to make cross-domain content-based and collaborative filtering recommendations. This paper demonstrates that the AAPAM method could seamlessly join different information domain datasets to act as one without any additi… ▽ More

    Submitted 10 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 17 pages, 8 tables, 4 figures and paper has been accepted by the 2nd International Conference on Natural Language Processing, Information Retrieval and AI (NIAI 2021) to be held on January 23~24, 2021 in Zurich, Switzerland

    Journal ref: David C. Wyld et al. (Eds): AIAP, SIGML, CNSA, NIAI - 2021 pp. 113-129, 2021. CS & IT - CSCP 2021

  15. arXiv:2010.04755  [pdf, other

    cs.CL

    Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model

    Authors: Jun Yen Leung, Guy Emerson, Ryan Cotterell

    Abstract: Across languages, multiple consecutive adjectives modifying a noun (e.g. "the big red dog") follow certain unmarked ordering rules. While explanatory accounts have been put forward, much of the work done in this area has relied primarily on the intuitive judgment of native speakers, rather than on corpus data. We present the first purely corpus-driven model of multi-lingual adjective ordering in t… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 13 pages, 7 tables, 1 figure. To be published in EMNLP 2020 proceedings

  16. arXiv:2009.00550  [pdf, other

    cs.SI

    Using Social Networks to Improve Group Transition Prediction in Professional Sports

    Authors: Emily J. Evans, Rebecca Jones, Joseph Leung, Benjamin Z. Webb

    Abstract: We examine whether social data can be used to predict how members of Major League Baseball (MLB) and members of the National Basketball Association (NBA) transition between teams during their career. We find that incorporating social data into various machine learning algorithms substantially improves the algorithms' ability to correctly determine these transitions. In particular, we measure how p… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    MSC Class: 91D30; 62P25; 91B40

  17. Counterfactual Explanations for Machine Learning on Multivariate Time Series Data

    Authors: Emre Ates, Burak Aksar, Vitus J. Leung, Ayse K. Coskun

    Abstract: Applying machine learning (ML) on multivariate time series data has growing popularity in many application domains, including in computer system management. For example, recent high performance computing (HPC) research proposes a variety of ML frameworks that use system telemetry data in the form of multivariate time series so as to detect performance variations, perform intelligent scheduling or… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Journal ref: 2021 International Conference on Applied Artificial Intelligence (ICAPAI), 2021, pp. 1-8

  18. Text-based Emotion Aware Recommender

    Authors: John Kalung Leung, Igor Griva, William G. Kennedy

    Abstract: We apply the concept of users' emotion vectors (UVECs) and movies' emotion vectors (MVECs) as building components of Emotion Aware Recommender System. We built a comparative platform that consists of five recommenders based on content-based and collaborative filtering algorithms. We employed a Tweets Affective Classifier to classify movies' emotion profiles through movie overviews. We construct MV… ▽ More

    Submitted 28 July, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: 13 pages, 8 tables, International Conference on Natural Language Computing and AI (NLCAI2020) July25-26, London, United Kingdom

    Journal ref: David C. Wyld et al. (Eds): CCSEA, BIoT, DKMP, CLOUD, NLCAI, SIPRO - 2020 pp. 101-114, 2020

  19. Using Affective Features from Media Content Metadata for Better Movie Recommendations

    Authors: John Kalung Leung, Igor Griva, William G. Kennedy

    Abstract: This paper investigates the causality in the decision making of movie recommendations through the users' affective profiles. We advocate a method of assigning emotional tags to a movie by the auto-detection of the affective features in the movie's overview. We apply a text-based Emotion Detection and Recognition model, which trained by tweets short messages and transfers the learned model to detec… ▽ More

    Submitted 10 February, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 8 pages, 10 tables, 1 figure, and presented in KDIR2020 Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management

    Journal ref: In Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, 161-168, 2020

  20. arXiv:2004.07213  [pdf, ps, other

    cs.CY

    Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

    Authors: Miles Brundage, Shahar Avin, Jasmine Wang, Haydn Belfield, Gretchen Krueger, Gillian Hadfield, Heidy Khlaaf, Jingying Yang, Helen Toner, Ruth Fong, Tegan Maharaj, Pang Wei Koh, Sara Hooker, Jade Leung, Andrew Trask, Emma Bluemke, Jonathan Lebensold, Cullen O'Keefe, Mark Koren, Théo Ryffel, JB Rubinovitz, Tamay Besiroglu, Federica Carugati, Jack Clark, Peter Eckersley , et al. (34 additional authors not shown)

    Abstract: With the recent wave of progress in artificial intelligence (AI) has come a growing awareness of the large-scale impacts of AI systems, and recognition that existing regulations and norms in industry and academia are insufficient to ensure responsible AI development. In order for AI developers to earn trust from system users, customers, civil society, governments, and other stakeholders that they… ▽ More

    Submitted 20 April, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

  21. arXiv:1912.11595  [pdf

    cs.CY cs.AI

    The Windfall Clause: Distributing the Benefits of AI for the Common Good

    Authors: Cullen O'Keefe, Peter Cihon, Ben Garfinkel, Carrick Flynn, Jade Leung, Allan Dafoe

    Abstract: As the transformative potential of AI has become increasingly salient as a matter of public and political interest, there has been growing discussion about the need to ensure that AI broadly benefits humanity. This in turn has spurred debate on the social responsibilities of large technology companies to serve the interests of society at large. In response, ethical principles and codes of conduct… ▽ More

    Submitted 24 January, 2020; v1 submitted 25 December, 2019; originally announced December 2019.

    Comments: Short version to be published in proceedings of AIES

  22. arXiv:1604.05213  [pdf, other

    physics.med-ph cs.CV physics.optics

    Non-contact hemodynamic imaging reveals the jugular venous pulse waveform

    Authors: Robert Amelard, Richard L Hughson, Danielle K Greaves, Kaylen J Pfisterer, Jason Leung, David A Clausi, Alexander Wong

    Abstract: Cardiovascular monitoring is important to prevent diseases from progressing. The jugular venous pulse (JVP) waveform offers important clinical information about cardiac health, but is not routinely examined due to its invasive catheterisation procedure. Here, we demonstrate for the first time that the JVP can be consistently observed in a non-contact manner using a novel light-based photoplethysmo… ▽ More

    Submitted 21 April, 2016; v1 submitted 15 April, 2016; originally announced April 2016.

    Comments: 10 pages, 8 figures

  23. arXiv:0712.1854  [pdf

    cs.NI cs.PF

    Back-of-the-Envelope Computation of Throughput Distributions in CSMA Wireless Networks

    Authors: S. C. Liew, C. Kai, J. Leung, B. Wong

    Abstract: This work started out with our accidental discovery of a pattern of throughput distributions among links in IEEE 802.11 networks from experimental results. This pattern gives rise to an easy computation method, which we term back-of-the-envelop (BoE) computation, because for many network configurations, very accurate results can be obtained within minutes, if not seconds, by simple hand computat… ▽ More

    Submitted 11 December, 2007; originally announced December 2007.

  24. arXiv:cs/0407058  [pdf, ps, other

    cs.DS cs.DC

    Communication-Aware Processor Allocation for Supercomputers

    Authors: Michael A. Bender, David P. Bunde, Erik D. Demaine, Sandor P. Fekete, Vitus J. Leung, Henk Meijer, Cynthia A. Phillips

    Abstract: This paper gives processor-allocation algorithms for minimizing the average number of communication hops between the assigned processors for grid architectures, in the presence of occupied cells. The simpler problem of assigning processors on a free grid has been studied by Karp, McKellar, and Wong who show that the solutions have nontrivial structure; they left open the complexity of the proble… ▽ More

    Submitted 6 December, 2005; v1 submitted 24 July, 2004; originally announced July 2004.

    Comments: 19 pages, 7 figures, 1 table, Latex, submitted for journal publication. Previous version is extended abstract (14 pages), appeared in Proceedings WADS, Springer LNCS 3608, pp. 169-181

    ACM Class: F.2.2; C.1.4