Search | arXiv e-print repository

The Tracking Machine Learning challenge : Throughput phase

Authors: Sabrina Amrouche, Laurent Basara, Paolo Calafiura, Dmitry Emeliyanov, Victor Estrade, Steven Farrell, Cécile Germain, Vladimir Vava Gligorov, Tobias Golling, Sergey Gorbunov, Heather Gray, Isabelle Guyon, Mikhail Hushchyn, Vincenzo Innocente, Moritz Kiehn, Marcel Kunze, Edward Moyse, David Rousseau, Andreas Salzburger, Andrey Ustyuzhanin, Jean-Roch Vlimant

Abstract: This paper reports on the second "Throughput" phase of the Tracking Machine Learning (TrackML) challenge on the Codalab platform. As in the first "Accuracy" phase, the participants had to solve a difficult experimental problem linked to tracking accurately the trajectory of particles as e.g. created at the Large Hadron Collider (LHC): given O($10^5$) points, the participants had to connect them in… ▽ More This paper reports on the second "Throughput" phase of the Tracking Machine Learning (TrackML) challenge on the Codalab platform. As in the first "Accuracy" phase, the participants had to solve a difficult experimental problem linked to tracking accurately the trajectory of particles as e.g. created at the Large Hadron Collider (LHC): given O($10^5$) points, the participants had to connect them into O($10^4$) individual groups that represent the particle trajectories which are approximated helical. While in the first phase only the accuracy mattered, the goal of this second phase was a compromise between the accuracy and the speed of inference. Both were measured on the Codalab platform where the participants had to upload their software. The best three participants had solutions with good accuracy and speed an order of magnitude faster than the state of the art when the challenge was designed. Although the core algorithms were less diverse than in the first phase, a diversity of techniques have been used and are described in this paper. The performance of the algorithms are analysed in depth and lessons derived. △ Less

Submitted 14 May, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

Comments: submitted to Computing and Software for Big Science

arXiv:2103.16705 [pdf]

doi 10.1016/j.ijcci.2021.100273

Designing Building Blocks for Open-Ended Early Literacy Software

Authors: Ivan Sysoev, James H. Gray, Susan Fine, Deb Roy

Abstract: English has a convoluted relationship between its pronunciation and spelling, which obscures its phonological structure for early literacy learners. This convoluted relationship has implications for early literacy software, particularly for open-ended, child-driven designs. A tempting way to bypass this issue is to use manipulables (blocks) that are directly tied to phonemes. However, creating pho… ▽ More English has a convoluted relationship between its pronunciation and spelling, which obscures its phonological structure for early literacy learners. This convoluted relationship has implications for early literacy software, particularly for open-ended, child-driven designs. A tempting way to bypass this issue is to use manipulables (blocks) that are directly tied to phonemes. However, creating phoneme-based blocks leads to two design challenges: (a) how to represent phonemes visually in a child-accessible way and (b) how to account for context-dependent spelling. In the present work, we approached these challenges by developing a set of animated, onomatopoeia-based mnemonic characters, one per phoneme, that can take the shape of different graphemes.We applied the characters to a construction-based literacy app to simplify independent word-building for literacy beginners. We tested the app during a 13-week-long period with 4- to 5-year-olds in kindergarten classrooms. Children showed visible interest in the characters and properly grasped the principles of their functioning. However, the blocks were not sufficient to scaffold independent word building, leading children to rely on other scaffolding mechanisms. To test the characters' efficiency as mnemonics, we evaluated their effect on the speed and accuracy of finding phonemes on a keyboard. The results suggest that there were both children who benefitted from the characters in this task and those who performed better without them. The factors that differentiated these two categories are currently unclear. To help further research on phonetic mnemonics in literacy learning software, we are making the characters available to the research community. △ Less

Submitted 30 March, 2021; originally announced March 2021.

Comments: This is a published manuscript for the paper published in the International Journal of Child-Computer Interaction. Sharing on ArXiv is in accordance with Elsevier sharing policy

Journal ref: International Journal of Child-Computer Interaction. Article 100273 (2021)

arXiv:2103.14737 [pdf, ps, other]

Porting HEP Parameterized Calorimeter Simulation Code to GPUs

Authors: Zhihua Dong, Heather Gray, Charles Leggett, Meifeng Lin, Vincent R. Pascuzzi, Kwangmin Yu

Abstract: The High Energy Physics (HEP) experiments, such as those at the Large Hadron Collider (LHC), traditionally consume large amounts of CPU cycles for detector simulations and data analysis, but rarely use compute accelerators such as GPUs. As the LHC is upgraded to allow for higher luminosity, resulting in much higher data rates, purely relying on CPUs may not provide enough computing power to suppor… ▽ More The High Energy Physics (HEP) experiments, such as those at the Large Hadron Collider (LHC), traditionally consume large amounts of CPU cycles for detector simulations and data analysis, but rarely use compute accelerators such as GPUs. As the LHC is upgraded to allow for higher luminosity, resulting in much higher data rates, purely relying on CPUs may not provide enough computing power to support the simulation and data analysis needs. As a proof of concept, we investigate the feasibility of porting a HEP parameterized calorimeter simulation code to GPUs. We have chosen to use FastCaloSim, the ATLAS fast parametrized calorimeter simulation. While FastCaloSim is sufficiently fast such that it does not impose a bottleneck in detector simulations overall, significant speed-ups in the processing of large samples can be achieved from GPU parallelization at both the particle (intra-event) and event levels; this is especially beneficial in conditions expected at the high-luminosity LHC, where extremely high per-event particle multiplicities will result from the many simultaneous proton-proton collisions. We report our experience with porting FastCaloSim to NVIDIA GPUs using CUDA. A preliminary Kokkos implementation of FastCaloSim for portability to other parallel architectures is also described. △ Less

Submitted 18 May, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

Comments: 15 pages, 1 figure, 8 tables, 2 listings, submitted to Frontiers in Big Data (Big Data in AI and High Energy Physics)

arXiv:1812.04901 [pdf]

Automatic individual pig detection and tracking in surveillance videos

Authors: Lei Zhang, Helen Gray, Xujiong Ye, Lisa Collins, Nigel Allinson

Abstract: Individual pig detection and tracking is an important requirement in many video-based pig monitoring applications. However, it still remains a challenging task in complex scenes, due to problems of light fluctuation, similar appearances of pigs, shape deformations and occlusions. To tackle these problems, we propose a robust real time multiple pig detection and tracking method which does not requi… ▽ More Individual pig detection and tracking is an important requirement in many video-based pig monitoring applications. However, it still remains a challenging task in complex scenes, due to problems of light fluctuation, similar appearances of pigs, shape deformations and occlusions. To tackle these problems, we propose a robust real time multiple pig detection and tracking method which does not require manual marking or physical identification of the pigs, and works under both daylight and infrared light conditions. Our method couples a CNN-based detector and a correlation filter-based tracker via a novel hierarchical data association algorithm. The detector gains the best accuracy/speed trade-off by using the features derived from multiple layers at different scales in a one-stage prediction network. We define a tag-box for each pig as the tracking target, and the multiple object tracking is conducted in a key-points tracking manner using learned correlation filters. Under challenging conditions, the tracking failures are modelled based on the relations between responses of detector and tracker, and the data association algorithm allows the detection hypotheses to be refined, meanwhile the drifted tracks can be corrected by probing the tracking failures followed by the re-initialization of tracking. As a result, the optimal tracklets can sequentially grow with on-line refined detections, and tracking fragments are correctly integrated into respective tracks while keeping the original identifications. Experiments with a dataset captured from a commercial farm show that our method can robustly detect and track multiple pigs under challenging conditions. The promising performance of the proposed method also demonstrates a feasibility of long-term individual pig tracking in a complex environment and thus promises a commercial potential. △ Less

Submitted 12 December, 2018; originally announced December 2018.

Comments: 19 pages, 10 figures, 3 tables

Showing 1–4 of 4 results for author: Gray, H