-
Integrating HCI Datasets in Project-Based Machine Learning Courses: A College-Level Review and Case Study
Authors:
Xiaodong Qu,
Matthew Key,
Eric Luo,
Chuhui Qiu
Abstract:
This study explores the integration of real-world machine learning (ML) projects using human-computer interfaces (HCI) datasets in college-level courses to enhance both teaching and learning experiences. Employing a comprehensive literature review, course websites analysis, and a detailed case study, the research identifies best practices for incorporating HCI datasets into project-based ML educat…
▽ More
This study explores the integration of real-world machine learning (ML) projects using human-computer interfaces (HCI) datasets in college-level courses to enhance both teaching and learning experiences. Employing a comprehensive literature review, course websites analysis, and a detailed case study, the research identifies best practices for incorporating HCI datasets into project-based ML education. Key f indings demonstrate increased student engagement, motivation, and skill development through hands-on projects, while instructors benefit from effective tools for teaching complex concepts. The study also addresses challenges such as data complexity and resource allocation, offering recommendations for future improvements. These insights provide a valuable framework for educators aiming to bridge the gap between
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
The Case for Transport-Level Encryption in Datacenter Networks
Authors:
Tianyi Gao,
Xinshu Ma,
Suhas Narreddy,
Eugenio Luo,
Steven W. D. Chien,
Michio Honda
Abstract:
Cloud applications need network data encryption to isolate from other tenants and protect their data from potential eavesdroppers in the network infrastructure. This paper presents SDP, a protocol design for emerging datacenter transport protocols, such as pHost, NDP, and Homa, to integrate data encryption with the use of existing NIC offloading of cryptographic operations designed for TLS over TC…
▽ More
Cloud applications need network data encryption to isolate from other tenants and protect their data from potential eavesdroppers in the network infrastructure. This paper presents SDP, a protocol design for emerging datacenter transport protocols, such as pHost, NDP, and Homa, to integrate data encryption with the use of existing NIC offloading of cryptographic operations designed for TLS over TCP. Therefore, SDP could enable a deployment path of new transport protocols in datacenters without giving up hardware offloading support, which would otherwise make encryption on those protocols even slower than TLS over TCP. SDP is based on Homa, and outperforms TLS over TCP by up to 29 % in throughput. SDP currently supports two real-world applications, Redis, improving throughput by up to 24 %, and in-kernel NVMe-oF, cutting P99 latency by up to 21 %.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Reachability Analysis for Linear Systems with Uncertain Parameters using Polynomial Zonotopes
Authors:
Yushen Huang,
Ertai Luo,
Stanley Bak,
Yifan Sun
Abstract:
In real world applications, uncertain parameters are the rule rather than the exception. We present a reachability algorithm for linear systems with uncertain parameters and inputs using set propagation of polynomial zonotopes. In contrast to previous methods, our approach is able to tightly capture the non-convexity of the reachable set. Building up on our main result, we show how our reachabilit…
▽ More
In real world applications, uncertain parameters are the rule rather than the exception. We present a reachability algorithm for linear systems with uncertain parameters and inputs using set propagation of polynomial zonotopes. In contrast to previous methods, our approach is able to tightly capture the non-convexity of the reachable set. Building up on our main result, we show how our reachability algorithm can be extended to handle linear time-varying systems as well as linear systems with time-varying parameters. Moreover, our approach opens up new possibilities for reachability analysis of linear time-invariant systems, nonlinear systems, and hybrid systems. We compare our approach to other state of the art methods, with superior tightness on two benchmarks including a 9-dimensional vehicle platooning system. Moreover, as part of the journal extension, we investigate through a polynomial zonotope with special structure named multi-affine zonotopes and its optimization problem. We provide the corresponding optimization algorithm and experiment over the examples obatined from two benchmark systems, showing the efficiency and scalability comparing to the state of the art method for handling such type of set representation.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Authors:
Imad Eddine Toubal,
Aditya Avinash,
Neil Gordon Alldrin,
Jan Dlabal,
Wenlei Zhou,
Enming Luo,
Otilia Stretcu,
Hao Xiong,
Chun-Ta Lu,
Howard Zhou,
Ranjay Krishna,
Ariel Fuxman,
Tom Duerig
Abstract:
From content moderation to wildlife conservation, the number of applications that require models to recognize nuanced or subjective visual concepts is growing. Traditionally, developing classifiers for such concepts requires substantial manual effort measured in hours, days, or even months to identify and annotate data needed for training. Even with recently proposed Agile Modeling techniques, whi…
▽ More
From content moderation to wildlife conservation, the number of applications that require models to recognize nuanced or subjective visual concepts is growing. Traditionally, developing classifiers for such concepts requires substantial manual effort measured in hours, days, or even months to identify and annotate data needed for training. Even with recently proposed Agile Modeling techniques, which enable rapid bootstrapping of image classifiers, users are still required to spend 30 minutes or more of monotonous, repetitive data labeling just to train a single classifier. Drawing on Fiske's Cognitive Miser theory, we propose a new framework that alleviates manual effort by replacing human labeling with natural language interactions, reducing the total effort required to define a concept by an order of magnitude: from labeling 2,000 images to only 100 plus some natural language interactions. Our framework leverages recent advances in foundation models, both large language models and vision-language models, to carve out the concept space through conversation and by automatically labeling training data points. Most importantly, our framework eliminates the need for crowd-sourced annotations. Moreover, our framework ultimately produces lightweight classification models that are deployable in cost-sensitive scenarios. Across 15 subjective concepts and across 2 public image classification datasets, our trained models outperform traditional Agile Modeling as well as state-of-the-art zero-shot classification models like ALIGN, CLIP, CuPL, and large visual question-answering models like PaLI-X.
△ Less
Submitted 19 March, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Scaling Up LLM Reviews for Google Ads Content Moderation
Authors:
Wei Qiao,
Tushar Dogra,
Otilia Stretcu,
Yu-Han Lyu,
Tiantian Fang,
Dongjin Kwon,
Chun-Ta Lu,
Enming Luo,
Yuan Wang,
Chih-Chun Chia,
Ariel Fuxman,
Fangzhou Wang,
Ranjay Krishna,
Mehmet Tek
Abstract:
Large language models (LLMs) are powerful tools for content moderation, but their inference costs and latency make them prohibitive for casual use on large datasets, such as the Google Ads repository. This study proposes a method for scaling up LLM reviews for content moderation in Google Ads. First, we use heuristics to select candidates via filtering and duplicate removal, and create clusters of…
▽ More
Large language models (LLMs) are powerful tools for content moderation, but their inference costs and latency make them prohibitive for casual use on large datasets, such as the Google Ads repository. This study proposes a method for scaling up LLM reviews for content moderation in Google Ads. First, we use heuristics to select candidates via filtering and duplicate removal, and create clusters of ads for which we select one representative ad per cluster. We then use LLMs to review only the representative ads. Finally, we propagate the LLM decisions for the representative ads back to their clusters. This method reduces the number of reviews by more than 3 orders of magnitude while achieving a 2x recall compared to a baseline non-LLM model. The success of this approach is a strong function of the representations used in clustering and label propagation; we found that cross-modal similarity representations yield better results than uni-modal representations.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI
Authors:
Hanxue Gu,
Roy Colglazier,
Haoyu Dong,
Jikai Zhang,
Yaqian Chen,
Zafer Yildiz,
Yuwen Chen,
Lin Li,
Jichen Yang,
Jay Willhite,
Alex M. Meyer,
Brian Guo,
Yashvi Atul Shah,
Emily Luo,
Shipra Rajput,
Sally Kuehn,
Clark Bulleit,
Kevin A. Wu,
Jisoo Lee,
Brandon Ramirez,
Darui Lu,
Jay M. Levin,
Maciej A. Mazurowski
Abstract:
Magnetic Resonance Imaging (MRI) is pivotal in radiology, offering non-invasive and high-quality insights into the human body. Precise segmentation of MRIs into different organs and tissues would be highly beneficial since it would allow for a higher level of understanding of the image content and enable important measurements, which are essential for accurate diagnosis and effective treatment pla…
▽ More
Magnetic Resonance Imaging (MRI) is pivotal in radiology, offering non-invasive and high-quality insights into the human body. Precise segmentation of MRIs into different organs and tissues would be highly beneficial since it would allow for a higher level of understanding of the image content and enable important measurements, which are essential for accurate diagnosis and effective treatment planning. Specifically, segmenting bones in MRI would allow for more quantitative assessments of musculoskeletal conditions, while such assessments are largely absent in current radiological practice. The difficulty of bone MRI segmentation is illustrated by the fact that limited algorithms are publicly available for use, and those contained in the literature typically address a specific anatomic area. In our study, we propose a versatile, publicly available deep-learning model for bone segmentation in MRI across multiple standard MRI locations. The proposed model can operate in two modes: fully automated segmentation and prompt-based segmentation. Our contributions include (1) collecting and annotating a new MRI dataset across various MRI protocols, encompassing over 300 annotated volumes and 8485 annotated slices across diverse anatomic regions; (2) investigating several standard network architectures and strategies for automated segmentation; (3) introducing SegmentAnyBone, an innovative foundational model-based approach that extends Segment Anything Model (SAM); (4) comparative analysis of our algorithm and previous approaches; and (5) generalization analysis of our algorithm across different anatomical locations and MRI sequences, as well as an external dataset. We publicly release our model at https://github.com/mazurowski-lab/SegmentAnyBone.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
scDiffusion: conditional generation of high-quality single-cell data using diffusion model
Authors:
Erpai Luo,
Minsheng Hao,
Lei Wei,
Xuegong Zhang
Abstract:
Single-cell RNA sequencing (scRNA-seq) data are important for studying the laws of life at single-cell level. However, it is still challenging to obtain enough high-quality scRNA-seq data. To mitigate the limited availability of data, generative models have been proposed to computationally generate synthetic scRNA-seq data. Nevertheless, the data generated with current models are not very realisti…
▽ More
Single-cell RNA sequencing (scRNA-seq) data are important for studying the laws of life at single-cell level. However, it is still challenging to obtain enough high-quality scRNA-seq data. To mitigate the limited availability of data, generative models have been proposed to computationally generate synthetic scRNA-seq data. Nevertheless, the data generated with current models are not very realistic yet, especially when we need to generate data with controlled conditions. In the meantime, the Diffusion models have shown their power in generating data at high fidelity, providing a new opportunity for scRNA-seq generation.
In this study, we developed scDiffusion, a generative model combining diffusion model and foundation model to generate high-quality scRNA-seq data with controlled conditions. We designed multiple classifiers to guide the diffusion process simultaneously, enabling scDiffusion to generate data under multiple condition combinations. We also proposed a new control strategy called Gradient Interpolation. This strategy allows the model to generate continuous trajectories of cell development from a given cell state.
Experiments showed that scDiffusion can generate single-cell gene expression data closely resembling real scRNA-seq data. Also, scDiffusion can conditionally produce data on specific cell types including rare cell types. Furthermore, we could use the multiple-condition generation of scDiffusion to generate cell type that was out of the training data. Leveraging the Gradient Interpolation strategy, we generated a continuous developmental trajectory of mouse embryonic cells. These experiments demonstrate that scDiffusion is a powerful tool for augmenting the real scRNA-seq data and can provide insights into cell fate research.
△ Less
Submitted 4 March, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models
Authors:
Yushi Hu,
Otilia Stretcu,
Chun-Ta Lu,
Krishnamurthy Viswanathan,
Kenji Hata,
Enming Luo,
Ranjay Krishna,
Ariel Fuxman
Abstract:
Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. Recent work shows promise by decomposing such tasks using a large language model (LLM) into an executable program that invokes specialized vision models. However, generated programs are error-pr…
▽ More
Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. Recent work shows promise by decomposing such tasks using a large language model (LLM) into an executable program that invokes specialized vision models. However, generated programs are error-prone: they omit necessary steps, include spurious ones, and are unable to recover when the specialized models give incorrect outputs. Moreover, they require loading multiple models, incurring high latency and computation costs. We propose Visual Program Distillation (VPD), an instruction tuning framework that produces a vision-language model (VLM) capable of solving complex visual tasks with a single forward pass. VPD distills the reasoning ability of LLMs by using them to sample multiple candidate programs, which are then executed and verified to identify a correct one. It translates each correct program into a language description of the reasoning steps, which are then distilled into a VLM. Extensive experiments show that VPD improves the VLM's ability to count, understand spatial relations, and reason compositionally. Our VPD-trained PaLI-X outperforms all prior VLMs, achieving state-of-the-art performance across complex vision tasks, including MMBench, OK-VQA, A-OKVQA, TallyQA, POPE, and Hateful Memes. An evaluation with human annotators also confirms that VPD improves model response factuality and consistency. Finally, experiments on content moderation demonstrate that VPD is also helpful for adaptation to real-world applications with limited data.
△ Less
Submitted 5 April, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
DeepBurning-MixQ: An Open Source Mixed-Precision Neural Network Accelerator Design Framework for FPGAs
Authors:
Erjing Luo,
Haitong Huang,
Cheng Liu,
Guoyu Li,
Bing Yang,
Ying Wang,
Huawei Li,
Xiaowei Li
Abstract:
Mixed-precision neural networks (MPNNs) that enable the use of just enough data width for a deep learning task promise significant advantages of both inference accuracy and computing overhead. FPGAs with fine-grained reconfiguration capability can adapt the processing with distinct data width and models, and hence, can theoretically unleash the potential of MPNNs. Nevertheless, commodity DPUs on F…
▽ More
Mixed-precision neural networks (MPNNs) that enable the use of just enough data width for a deep learning task promise significant advantages of both inference accuracy and computing overhead. FPGAs with fine-grained reconfiguration capability can adapt the processing with distinct data width and models, and hence, can theoretically unleash the potential of MPNNs. Nevertheless, commodity DPUs on FPGAs mostly emphasize generality and have limited support for MPNNs especially the ones with lower data width. In addition, primitive DSPs in FPGAs usually have much larger data width than that is required by MPNNs and haven't been sufficiently co-explored with MPNNs yet. To this end, we propose an open source MPNN accelerator design framework specifically tailored for FPGAs. In this framework, we have a systematic DSP-packing algorithm to pack multiple lower data width MACs in a single primitive DSP and enable efficient implementation of MPNNs. Meanwhile, we take DSP packing efficiency into consideration with MPNN quantization within a unified neural network architecture search (NAS) framework such that it can be aware of the DSP overhead during quantization and optimize the MPNN performance and accuracy concurrently. Finally, we have the optimized MPNN fine-tuned to a fully pipelined neural network accelerator template based on HLS and make best use of available resources for higher performance. Our experiments reveal the resulting accelerators produced by the proposed framework can achieve overwhelming advantages in terms of performance, resource utilization, and inference accuracy for MPNNs when compared with both handcrafted counterparts and prior hardware-aware neural network accelerators on FPGAs.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
On the Difficulty of Intersection Checking with Polynomial Zonotopes
Authors:
Yushen Huang,
Ertai Luo,
Stanley Bak,
Yifan Sun
Abstract:
Polynomial zonotopes, a non-convex set representation, have a wide range of applications from real-time motion planning and control in robotics, to reachability analysis of nonlinear systems and safety shielding in reinforcement learning. Despite this widespread use, a frequently overlooked difficulty associated with polynomial zonotopes is intersection checking. Determining whether the reachable…
▽ More
Polynomial zonotopes, a non-convex set representation, have a wide range of applications from real-time motion planning and control in robotics, to reachability analysis of nonlinear systems and safety shielding in reinforcement learning. Despite this widespread use, a frequently overlooked difficulty associated with polynomial zonotopes is intersection checking. Determining whether the reachable set, represented as a polynomial zonotope, intersects an unsafe set is not straightforward. In fact, we show that this fundamental operation is NP-hard, even for a simple class of polynomial zonotopes. The standard method for intersection checking with polynomial zonotopes is a two-part algorithm that overapproximates a polynomial zonotope with a regular zonotope and then, if the overapproximation error is deemed too large, splits the set and recursively tries again. Beyond the possible need for a large number of splits, we identify two sources of concern related to this algorithm: (1) overapproximating a polynomial zonotope with a zonotope has unbounded error, and (2) after splitting a polynomial zonotope, the overapproximation error can actually increase. Taken together, this implies there may be a possibility that the algorithm does not always terminate.We perform a rigorous analysis of the method and detail necessary conditions for the union of overapproximations to provably converge to the original polynomial zonotope.
△ Less
Submitted 17 May, 2023; v1 submitted 16 May, 2023;
originally announced May 2023.
-
Towards Understanding the Effect of Pretraining Label Granularity
Authors:
Guan Zhe Hong,
Yin Cui,
Ariel Fuxman,
Stanley H. Chan,
Enming Luo
Abstract:
In this paper, we study how the granularity of pretraining labels affects the generalization of deep neural networks in image classification tasks. We focus on the "fine-to-coarse" transfer learning setting, where the pretraining label space is more fine-grained than that of the target problem. Empirically, we show that pretraining on the leaf labels of ImageNet21k produces better transfer results…
▽ More
In this paper, we study how the granularity of pretraining labels affects the generalization of deep neural networks in image classification tasks. We focus on the "fine-to-coarse" transfer learning setting, where the pretraining label space is more fine-grained than that of the target problem. Empirically, we show that pretraining on the leaf labels of ImageNet21k produces better transfer results on ImageNet1k than pretraining on other coarser granularity levels, which supports the common practice used in the community. Theoretically, we explain the benefit of fine-grained pretraining by proving that, for a data distribution satisfying certain hierarchy conditions, 1) coarse-grained pretraining only allows a neural network to learn the "common" or "easy-to-learn" features well, while 2) fine-grained pretraining helps the network learn the "rarer" or "fine-grained" features in addition to the common ones, thus improving its accuracy on hard downstream test samples in which common features are missing or weak in strength. Furthermore, we perform comprehensive experiments using the label hierarchies of iNaturalist 2021 and observe that the following conditions, in addition to proper choice of label granularity, enable the transfer to work well in practice: 1) the pretraining dataset needs to have a meaningful label hierarchy, and 2) the pretraining and target label functions need to align well.
△ Less
Submitted 5 October, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Agile Modeling: From Concept to Classifier in Minutes
Authors:
Otilia Stretcu,
Edward Vendrow,
Kenji Hata,
Krishnamurthy Viswanathan,
Vittorio Ferrari,
Sasan Tavakkol,
Wenlei Zhou,
Aditya Avinash,
Enming Luo,
Neil Gordon Alldrin,
MohammadHossein Bateni,
Gabriel Berger,
Andrew Bunner,
Chun-Ta Lu,
Javier A Rey,
Giulia DeSalvo,
Ranjay Krishna,
Ariel Fuxman
Abstract:
The application of computer vision to nuanced subjective use cases is growing. While crowdsourcing has served the vision community well for most objective tasks (such as labeling a "zebra"), it now falters on tasks where there is substantial subjectivity in the concept (such as identifying "gourmet tuna"). However, empowering any user to develop a classifier for their concept is technically diffic…
▽ More
The application of computer vision to nuanced subjective use cases is growing. While crowdsourcing has served the vision community well for most objective tasks (such as labeling a "zebra"), it now falters on tasks where there is substantial subjectivity in the concept (such as identifying "gourmet tuna"). However, empowering any user to develop a classifier for their concept is technically difficult: users are neither machine learning experts, nor have the patience to label thousands of examples. In reaction, we introduce the problem of Agile Modeling: the process of turning any subjective visual concept into a computer vision model through a real-time user-in-the-loop interactions. We instantiate an Agile Modeling prototype for image classification and show through a user study (N=14) that users can create classifiers with minimal effort under 30 minutes. We compare this user driven process with the traditional crowdsourcing paradigm and find that the crowd's notion often differs from that of the user's, especially as the concepts become more subjective. Finally, we scale our experiments with simulations of users training classifiers for ImageNet21k categories to further demonstrate the efficacy.
△ Less
Submitted 12 May, 2023; v1 submitted 24 February, 2023;
originally announced February 2023.
-
Pre-Distribution of Entanglements in Quantum Networks
Authors:
Mohammad Ghaderibaneh,
Himanshu Gupta,
C. R. Ramakrishnan,
Ertai Luo
Abstract:
Quantum network communication is challenging, as the No-Cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability…
▽ More
Quantum network communication is challenging, as the No-Cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability of success of the underlying physical processes. To reduce EP generation latency, prior works have looked at selection of efficient entanglement-routing paths and simultaneous use of multiple such paths for EP generation. In this paper, we propose and investigate a complementary technique to reduce EP generation latency--to pre-distribute EPs over certain (pre-determined) pairs of network nodes; these pre-distributed EPs can then be used to generate EPs for the requested pairs, when needed, with lower generation latency. For such an pre-distribution approach to be most effective, we need to address an optimization problem of selection of node-pairs where the EPs should be pre-distributed to minimize the generation latency of expected EP requests, under a given cost constraint. In this paper, we appropriately formulate the above optimization problem and design two efficient algorithms, one of which is a greedy approach based on an approximation algorithm for a special case. Via extensive evaluations over the NetSquid simulator, we demonstrate the effectiveness of our approach and developed techniques; we show that our developed algorithms outperform a naive approach by up to an order of magnitude.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
NoiseRank: Unsupervised Label Noise Reduction with Dependence Models
Authors:
Karishma Sharma,
Pinar Donmez,
Enming Luo,
Yan Liu,
I. Zeki Yalniz
Abstract:
Label noise is increasingly prevalent in datasets acquired from noisy channels. Existing approaches that detect and remove label noise generally rely on some form of supervision, which is not scalable and error-prone. In this paper, we propose NoiseRank, for unsupervised label noise reduction using Markov Random Fields (MRF). We construct a dependence model to estimate the posterior probability of…
▽ More
Label noise is increasingly prevalent in datasets acquired from noisy channels. Existing approaches that detect and remove label noise generally rely on some form of supervision, which is not scalable and error-prone. In this paper, we propose NoiseRank, for unsupervised label noise reduction using Markov Random Fields (MRF). We construct a dependence model to estimate the posterior probability of an instance being incorrectly labeled given the dataset, and rank instances based on their estimated probabilities. Our method 1) Does not require supervision from ground-truth labels, or priors on label or noise distribution. 2) It is interpretable by design, enabling transparency in label noise removal. 3) It is agnostic to classifier architecture/optimization framework and content modality. These advantages enable wide applicability in real noise settings, unlike prior works constrained by one or more conditions. NoiseRank improves state-of-the-art classification on Food101-N (~20% noise), and is effective on high noise Clothing-1M (~40% noise).
△ Less
Submitted 14 March, 2020;
originally announced March 2020.
-
Active suppression of temperature oscillation from a pulse-tube cryocooler in a cryogen-free cryostat: Part 2. Experimental realization
Authors:
Changzhao Pan,
Jiangfeng Hu,
Haiyang Zhang,
Yaonan Song,
Dongxu Han,
Wenjing Liu,
Hui Chen,
Mark Plimmer,
Fernando Sparasci,
Ercang Luo,
Bo Gao,
Laurent Pitre
Abstract:
A cryogen-free cryostat cooled by a closed cycle cryocooler is compact, can provide uninterrupted long-term operation (up to ten thousand hours) and is suited to temperatures from 3 K to 300 K. Its intrinsic temperature oscillation, however, limits its application in experiments requiring high thermal stability at low temperature (below 77 K). Passive suppression methods are effective but all suff…
▽ More
A cryogen-free cryostat cooled by a closed cycle cryocooler is compact, can provide uninterrupted long-term operation (up to ten thousand hours) and is suited to temperatures from 3 K to 300 K. Its intrinsic temperature oscillation, however, limits its application in experiments requiring high thermal stability at low temperature (below 77 K). Passive suppression methods are effective but all suffer from drawbacks. We describe a novel, active suppression scheme more efficient than traditional proportional-integral (PI) control. The experimental results show that it can reduce the standard deviation of the temperature oscillation by a further 30% compared with PI feedback. To the best of our knowledge, this is the first time such active suppression of temperature oscillations has been implemented with the cryogen-free cryostat. The results also show, however, that an unwanted lower frequency thermal noise will be generated, which appears to be the limit of the method. Nevertheless, the approach could be used to improve the temperature stability in all cryogen-free cryostats.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Active suppression of temperature oscillation from a pulse-tube cryocooler in a cryogen-free cryostat: Part 1. Simulation modeling from thermal response characteristics
Authors:
Changzhao Pan,
Bo Gao,
Yaonan Song,
Haiyang Zhang,
Dongxu Han,
Jiangfeng Hu,
Wenjing Liu,
Hui Chen,
Mark Plimmer,
Fernando Sparasci,
Ercang Luo,
Laurent Pitre
Abstract:
A cryogen-free cryostat cooled using a 4 K commercial GM or pulse tube cryocooler (PTC) displays temperature oscillations caused by the intrinsic working principle of the regenerative cryocooler. To dampen such oscillations usually requires either a large heat capacity or a large thermal resistance. To understand this phenomenon better and suppress it more effectively, both the step response chara…
▽ More
A cryogen-free cryostat cooled using a 4 K commercial GM or pulse tube cryocooler (PTC) displays temperature oscillations caused by the intrinsic working principle of the regenerative cryocooler. To dampen such oscillations usually requires either a large heat capacity or a large thermal resistance. To understand this phenomenon better and suppress it more effectively, both the step response characteristic and the intrinsic oscillation characteristic of cryostat have been used to obtain the complete transfer functions of a simulation model. The latter is used to test and optimize traditional PID feedback control. The results showed this approach has almost no effect on the temperature oscillation amplitude. Based on this simulation model, a novel active method was proposed and tested numerically. Simulation results predict the method should suppress the amplitude of the original temperature oscillation by a factor of two.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Realization of ppm level pressure stability for primary thermometry using a primary piston gauge
Authors:
Bo Gao,
Hui Chen,
Dongxu Han,
Pascal Gambette,
Haiyang Zhang,
Changzhao Pan,
Yingwen Liu,
Bo Yu,
Ercang Luo,
Mark Plimmer,
Laurent Pitre
Abstract:
To achieve an uncertainty of 0.25 mK in single-pressure refractive-index gas thermometry (SPRIGT), the relative pressure variation of He-4 gas in the range 30 kPa to 90 kPa, should not exceed 4 ppm (k=1). To this end, a novel pressure control system has been developed. It consists of two main parts: a piston gauge to control the pressure, and a home-made gas compensation system to supplement the m…
▽ More
To achieve an uncertainty of 0.25 mK in single-pressure refractive-index gas thermometry (SPRIGT), the relative pressure variation of He-4 gas in the range 30 kPa to 90 kPa, should not exceed 4 ppm (k=1). To this end, a novel pressure control system has been developed. It consists of two main parts: a piston gauge to control the pressure, and a home-made gas compensation system to supplement the micro-leak of the piston gauge. In addition, to maintain the piston at constant height, a servo loop is used that automatically determines in real time the amount of extra gas required. At room temperature, the standard deviations of the stabilized pressure are 3.0 mPa at 30 kPa, 4.5 mPa at 60 kPa and 2 mPa at 90 kPa. For the temperature region 5 K-25 K used for SPRIGT in the present work, the relative pressure stability is better than 0.16 ppm i.e. 25 times better than required. Moreover, the same pressure stabilization system is readily transposable to other primary gas thermometers.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Resonance frequency measurement with accuracy and stability at the 10-12 level in a copper microwave cavity below 26 K by experimental optimization
Authors:
Haiyang Zhang,
Bo Gao,
Wenjing Liu,
Changzhao Pan,
Dongxu Han,
Ercang Luo,
Laurent Pitre
Abstract:
Single pressure refractive index gas thermometry (SPRIGT) is a novel primary thermometry, jointly developed by TIPC of CAS in China and LNE-Cnam in France. To realize a competitive uncertainty of 0.25 mK for thermodynamic temperature measurements, high-stability and low-uncertainty of microwave resonance frequency measurements better than 2 ppb should be achieved. This article describes how to rea…
▽ More
Single pressure refractive index gas thermometry (SPRIGT) is a novel primary thermometry, jointly developed by TIPC of CAS in China and LNE-Cnam in France. To realize a competitive uncertainty of 0.25 mK for thermodynamic temperature measurements, high-stability and low-uncertainty of microwave resonance frequency measurements better than 2 ppb should be achieved. This article describes how to realize high-stability and low-uncertainty of resonance frequency measurements in a copper microwave cavity by experimental optimization methods based on Allan analysis of variance. In this manner, 10-12 level accuracy and stability of microwave resonance frequency measurements were realized with an integration time of 3 hours, which is nearly 20 times better than those without optimization in our previous work (Sci. Bull 2019; 64: 286-288). It has potential applications in gas metrology and other research fields, where high-stability and low-uncertainty microwave measurements are necessary. Besides, microwave measurements were carried out isobarically at pressures of (30, 60, 90, and 120) kPa over the temperature range of (5 to 26) K, with good microwave mode consistency for the determined thermodynamic temperatures. These will provide strong support for the success of the implementation of SPRIGT in China.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Prediction and realization of a temperature control limit at low temperatures in SPRIGT
Authors:
Haiyang Zhang,
Bo Gao,
Yaonan Song,
Changzhao Pan,
Jiangfeng Hu,
Dongxu Han,
Ercang Luo,
Laurent Pitre
Abstract:
On May 20th 2019, the World Metrology Day, the Bureau International des Poids et Mesures announced a major revision to the four more SI units. The base unit, the kelvin, is defined by fixing the value of Boltzmann constant as indicated in Mise en pratique for the definition of the kelvin in the SI. To realize the new kelvin, a novel practical realization technique of single-pressure refractive-ind…
▽ More
On May 20th 2019, the World Metrology Day, the Bureau International des Poids et Mesures announced a major revision to the four more SI units. The base unit, the kelvin, is defined by fixing the value of Boltzmann constant as indicated in Mise en pratique for the definition of the kelvin in the SI. To realize the new kelvin, a novel practical realization technique of single-pressure refractive-index gas thermometry (SPRIGT) has been jointly developed by the TIPC-CAS in China and the LNE-Cnam in France. To carry out accurate SPRIGT, experimental methods have been implemented and micro-kelvin level temperature control limits have been predicted and achieved at 5 K to 26 K. The resonator temperature stability can be maintained to within better than 8 μK of its set point with an integration time 33.6 s over 180 h. Besides, solutions for further improving the stability were also demonstrated, which can be a reference for temperature metrology field worldwide and other fields where high-stability temperature is required. The present work should also provide a solid foundation for international data comparison of thermodynamic temperature at low temperatures, and will promote realizations of the new kelvin and the spread of high-accuracy, low-temperature metrology.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Adaptive Image Denoising by Mixture Adaptation
Authors:
Enming Luo,
Stanley H. Chan,
Truong Q. Nguyen
Abstract:
We propose an adaptive learning procedure to learn patch-based image priors for image denoising. The new algorithm, called the Expectation-Maximization (EM) adaptation, takes a generic prior learned from a generic external database and adapts it to the noisy image to generate a specific prior. Different from existing methods that combine internal and external statistics in ad-hoc ways, the propose…
▽ More
We propose an adaptive learning procedure to learn patch-based image priors for image denoising. The new algorithm, called the Expectation-Maximization (EM) adaptation, takes a generic prior learned from a generic external database and adapts it to the noisy image to generate a specific prior. Different from existing methods that combine internal and external statistics in ad-hoc ways, the proposed algorithm is rigorously derived from a Bayesian hyper-prior perspective. There are two contributions of this paper: First, we provide full derivation of the EM adaptation algorithm and demonstrate methods to improve the computational complexity. Second, in the absence of the latent clean image, we show how EM adaptation can be modified based on pre-filtering. Experimental results show that the proposed adaptation algorithm yields consistently better denoising results than the one without adaptation and is superior to several state-of-the-art algorithms.
△ Less
Submitted 24 June, 2016; v1 submitted 18 January, 2016;
originally announced January 2016.
-
Adaptive Image Denoising by Targeted Databases
Authors:
Enming Luo,
Stanley H. Chan,
Truong Q. Nguyen
Abstract:
We propose a data-dependent denoising procedure to restore noisy images. Different from existing denoising algorithms which search for patches from either the noisy image or a generic database, the new algorithm finds patches from a database that contains only relevant patches. We formulate the denoising problem as an optimal filter design problem and make two contributions. First, we determine th…
▽ More
We propose a data-dependent denoising procedure to restore noisy images. Different from existing denoising algorithms which search for patches from either the noisy image or a generic database, the new algorithm finds patches from a database that contains only relevant patches. We formulate the denoising problem as an optimal filter design problem and make two contributions. First, we determine the basis function of the denoising filter by solving a group sparsity minimization problem. The optimization formulation generalizes existing denoising algorithms and offers systematic analysis of the performance. Improvement methods are proposed to enhance the patch search process. Second, we determine the spectral coefficients of the denoising filter by considering a localized Bayesian prior. The localized prior leverages the similarity of the targeted database, alleviates the intensive Bayesian computation, and links the new method to the classical linear minimum mean squared error estimation. We demonstrate applications of the proposed method in a variety of scenarios, including text images, multiview images and face images. Experimental results show the superiority of the new algorithm over existing methods.
△ Less
Submitted 3 November, 2014; v1 submitted 30 June, 2014;
originally announced July 2014.