Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Whitehill, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06582  [pdf, ps, other

    cs.CL cs.LG eess.AS

    Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing

    Authors: Viet Anh Trinh, Rosy Southwell, Yiwen Guan, Xinlu He, Zhiyong Wang, Jacob Whitehill

    Abstract: Recent work on discrete speech tokenization has paved the way for models that can seamlessly perform multiple tasks across modalities, e.g., speech recognition, text to speech, speech to speech translation. Moreover, large language models (LLMs) pretrained from vast text corpora contain rich linguistic information that can improve accuracy in a variety of tasks. In this paper, we present a decoder… ▽ More

    Submitted 25 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2310.01132  [pdf, other

    cs.CL cs.AI

    Automated Evaluation of Classroom Instructional Support with LLMs and BoWs: Connecting Global Predictions to Specific Feedback

    Authors: Jacob Whitehill, Jennifer LoCasale-Crouch

    Abstract: With the aim to provide teachers with more specific, frequent, and actionable feedback about their teaching, we explore how Large Language Models (LLMs) can be used to estimate ``Instructional Support'' domain scores of the CLassroom Assessment Scoring System (CLASS), a widely used observation protocol. We design a machine learning architecture that uses either zero-shot prompting of Meta's Llama2… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  3. Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification

    Authors: Zeqian Li, Xinlu He, Jacob Whitehill

    Abstract: We consider a novel clustering task in which clusters can have compositional relationships, e.g., one cluster contains images of rectangles, one contains images of circles, and a third (compositional) cluster contains images with both objects. In contrast to hierarchical clustering in which a parent cluster represents the intersection of properties of the child clusters, our problem is about findi… ▽ More

    Submitted 21 July, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

  4. arXiv:2103.03862  [pdf, other

    cs.CV cs.AI cs.CG cs.LG

    Harnessing Geometric Constraints from Emotion Labels to improve Face Verification

    Authors: Anand Ramakrishnan, Minh Pham, Jacob Whitehill

    Abstract: For the task of face verification, we explore the utility of harnessing auxiliary facial emotion labels to impose explicit geometric constraints on the embedding space when training deep embedding models. We introduce several novel loss functions that, in conjunction with a standard Triplet Loss [43], or ArcFace loss [10], provide geometric constraints on the embedding space; the labels for our lo… ▽ More

    Submitted 22 July, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 8 pages, 3 figures, 2 tables

  5. arXiv:2010.11803  [pdf, other

    cs.SD cs.CL eess.AS

    Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers

    Authors: Zeqian Li, Jacob Whitehill

    Abstract: We propose a new method for speaker diarization that can handle overlapping speech with 2+ people. Our method is based on compositional embeddings [1]: Like standard speaker embedding methods such as x-vector [2], compositional embedding models contain a function f that separates speech from different speakers. In addition, they include a composition function g to compute set-union operations in t… ▽ More

    Submitted 10 February, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

  6. arXiv:2005.09525   

    cs.CV cs.LG cs.SD eess.AS

    Toward Automated Classroom Observation: Multimodal Machine Learning to Estimate CLASS Positive Climate and Negative Climate

    Authors: Anand Ramakrishnan, Brian Zylich, Erin Ottmar, Jennifer LoCasale-Crouch, Jacob Whitehill

    Abstract: In this work we present a multi-modal machine learning-based system, which we call ACORN, to analyze videos of school classrooms for the Positive Climate (PC) and Negative Climate (NC) dimensions of the CLASS observation protocol that is widely used in educational research. ACORN uses convolutional neural networks to analyze spectral audio features, the faces of teachers and students, and the pixe… ▽ More

    Submitted 23 July, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: The authors discovered that the results are not reproducible

    Journal ref: IEEE Transactions on Affective Computing, 2021

  7. arXiv:2002.05242  [pdf, other

    cs.CV cs.HC cs.LG

    Leveraging Affect Transfer Learning for Behavior Prediction in an Intelligent Tutoring System

    Authors: Nataniel Ruiz, Hao Yu, Danielle A. Allessio, Mona Jalal, Ajjen Joshi, Thomas Murray, John J. Magee, Jacob R. Whitehill, Vitaly Ablavsky, Ivon Arroyo, Beverly P. Woolf, Stan Sclaroff, Margrit Betke

    Abstract: In this work, we propose a video-based transfer learning approach for predicting problem outcomes of students working with an intelligent tutoring system (ITS). By analyzing a student's face and gestures, our method predicts the outcome of a student answering a problem in an ITS from a video feed. Our work is motivated by the reasoning that the ability to predict such outcomes enables tutoring sys… ▽ More

    Submitted 8 April, 2022; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published at IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2021 - Best Poster Award (4% award rate)

  8. arXiv:2002.04193  [pdf, other

    cs.LG stat.ML

    Compositional Embeddings for Multi-Label One-Shot Learning

    Authors: Zeqian Li, Michael C. Mozer, Jacob Whitehill

    Abstract: We present a compositional embedding framework that infers not just a single class per input image, but a set of classes, in the setting of one-shot learning. Specifically, we propose and evaluate several novel models consisting of (1) an embedding function f trained jointly with a "composition" function g that computes set union operations between the classes encoded in two embedding vectors; and… ▽ More

    Submitted 13 November, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

  9. arXiv:1812.08255  [pdf, other

    cs.LG stat.ML

    Automatic Classifiers as Scientific Instruments: One Step Further Away from Ground-Truth

    Authors: Jacob Whitehill, Anand Ramakrishnan

    Abstract: Automatic machine learning-based detectors of various psychological and social phenomena (e.g., emotion, stress, engagement) have great potential to advance basic science. However, when a detector $d$ is trained to approximate an existing measurement tool (e.g., a questionnaire, observation protocol), then care must be taken when interpreting measurements collected using $d$ since they are one ste… ▽ More

    Submitted 4 May, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

  10. arXiv:1709.02418  [pdf, other

    cs.LG

    How Does Knowledge of the AUC Constrain the Set of Possible Ground-truth Labelings?

    Authors: Jacob Whitehill

    Abstract: Recent work on privacy-preserving machine learning has considered how data-mining competitions such as Kaggle could potentially be "hacked", either intentionally or inadvertently, by using information from an oracle that reports a classifier's accuracy on the test set. For binary classification tasks in particular, one of the most common accuracy metrics is the Area Under the ROC Curve (AUC), and… ▽ More

    Submitted 11 September, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

  11. arXiv:1707.01825  [pdf, other

    cs.LG

    Climbing the Kaggle Leaderboard by Exploiting the Log-Loss Oracle

    Authors: Jacob Whitehill

    Abstract: In the context of data-mining competitions (e.g., Kaggle, KDDCup, ILSVRC Challenge), we show how access to an oracle that reports a contestant's log-loss score on the test set can be exploited to deduce the ground-truth of some of the test examples. By applying this technique iteratively to batches of $m$ examples (for small $m$), all of the test labels can eventually be inferred. In this paper, (… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

  12. arXiv:1702.06404  [pdf, other

    cs.AI cs.CY

    Delving Deeper into MOOC Student Dropout Prediction

    Authors: Jacob Whitehill, Kiran Mohan, Daniel Seaton, Yigal Rosen, Dustin Tingley

    Abstract: In order to obtain reliable accuracy estimates for automatic MOOC dropout predictors, it is important to train and test them in a manner consistent with how they will be used in practice. Yet most prior research on MOOC dropout prediction has measured test accuracy on the same course used for training the classifier, which can lead to overly optimistic accuracy estimates. In order to understand be… ▽ More

    Submitted 21 February, 2017; originally announced February 2017.

  13. arXiv:1606.09610  [pdf, other

    cs.HC cs.CY

    A Crowdsourcing Approach To Collecting Tutorial Videos -- Toward Personalized Learning-at-Scale

    Authors: Jacob Whitehill, Margo Seltzer

    Abstract: We investigated the feasibility of crowdsourcing full-fledged tutorial videos from ordinary people on the Web on how to solve math problems related to logarithms. This kind of approach (a form of learnersourcing) to efficiently collecting tutorial videos and other learning resources could be useful for realizing personalized learning-at-scale, whereby students receive specific learning resources -… ▽ More

    Submitted 22 April, 2017; v1 submitted 30 June, 2016; originally announced June 2016.

  14. arXiv:1506.01339  [pdf, other

    cs.LG

    Exploiting an Oracle that Reports AUC Scores in Machine Learning Contests

    Authors: Jacob Whitehill

    Abstract: In machine learning contests such as the ImageNet Large Scale Visual Recognition Challenge and the KDD Cup, contestants can submit candidate solutions and receive from an oracle (typically the organizers of the competition) the accuracy of their guesses compared to the ground-truth labels. One of the most commonly used accuracy metrics for binary classification tasks is the Area Under the Receiver… ▽ More

    Submitted 13 November, 2015; v1 submitted 3 June, 2015; originally announced June 2015.

  15. arXiv:1306.0125  [pdf, other

    cs.LG

    Understanding ACT-R - an Outsider's Perspective

    Authors: Jacob Whitehill

    Abstract: The ACT-R theory of cognition developed by John Anderson and colleagues endeavors to explain how humans recall chunks of information and how they solve problems. ACT-R also serves as a theoretical basis for "cognitive tutors", i.e., automatic tutoring systems that help students learn mathematics, computer programming, and other subjects. The official ACT-R definition is distributed across a large… ▽ More

    Submitted 1 June, 2013; originally announced June 2013.

  16. arXiv:1110.0585  [pdf, other

    cs.CV

    Discriminately Decreasing Discriminability with Learned Image Filters

    Authors: Jacob Whitehill, Javier Movellan

    Abstract: In machine learning and computer vision, input images are often filtered to increase data discriminability. In some situations, however, one may wish to purposely decrease discriminability of one classification task (a "distractor" task), while simultaneously preserving information relevant to another (the task-of-interest): For example, it may be important to mask the identity of persons containe… ▽ More

    Submitted 4 October, 2011; originally announced October 2011.