Search | arXiv e-print repository

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp. For the latest activities, visit https://llm-jp.nii.ac.jp/en/. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2406.17185 [pdf, other]

Vaporetto: Efficient Japanese Tokenization Based on Improved Pointwise Linear Classification

Authors: Koichi Akabe, Shunsuke Kanda, Yusuke Oda, Shinsuke Mori

Abstract: This paper proposes an approach to improve the runtime efficiency of Japanese tokenization based on the pointwise linear classification (PLC) framework, which formulates the whole tokenization process as a sequence of linear classification problems. Our approach optimizes tokenization by leveraging the characteristics of the PLC framework and the task definition. Our approach involves (1) composin… ▽ More This paper proposes an approach to improve the runtime efficiency of Japanese tokenization based on the pointwise linear classification (PLC) framework, which formulates the whole tokenization process as a sequence of linear classification problems. Our approach optimizes tokenization by leveraging the characteristics of the PLC framework and the task definition. Our approach involves (1) composing multiple classifications into array-based operations, (2) efficient feature lookup with memory-optimized automata, and (3) three orthogonal pre-processing methods for reducing actual score calculation. Thus, our approach makes the tokenization speed 5.7 times faster than the current approach based on the same model without decreasing tokenization accuracy. Our implementation is available at https://github.com/daac-tools/vaporetto under the MIT or Apache-2.0 license. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2311.11690 [pdf, other]

doi 10.1109/APSEC60848.2023.00025

Refactoring Programs Using Large Language Models with Few-Shot Examples

Authors: Atsushi Shirafuji, Yusuke Oda, Jun Suzuki, Makoto Morishita, Yutaka Watanobe

Abstract: A less complex and more straightforward program is a crucial factor that enhances its maintainability and makes writing secure and bug-free programs easier. However, due to its heavy workload and the risks of breaking the working programs, programmers are reluctant to do code refactoring, and thus, it also causes the loss of potential learning experiences. To mitigate this, we demonstrate the appl… ▽ More A less complex and more straightforward program is a crucial factor that enhances its maintainability and makes writing secure and bug-free programs easier. However, due to its heavy workload and the risks of breaking the working programs, programmers are reluctant to do code refactoring, and thus, it also causes the loss of potential learning experiences. To mitigate this, we demonstrate the application of using a large language model (LLM), GPT-3.5, to suggest less complex versions of the user-written Python program, aiming to encourage users to learn how to write better programs. We propose a method to leverage the prompting with few-shot examples of the LLM by selecting the best-suited code refactoring examples for each target programming problem based on the prior evaluation of prompting with the one-shot example. The quantitative evaluation shows that 95.68% of programs can be refactored by generating 10 candidates each, resulting in a 17.35% reduction in the average cyclomatic complexity and a 25.84% decrease in the average number of lines after filtering only generated programs that are semantically correct. Furthermore, the qualitative evaluation shows outstanding capability in code formatting, while unnecessary behaviors such as deleting or translating comments are also observed. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 10 pages, 10 figures, accepted to the 30th Asia-Pacific Software Engineering Conference (APSEC 2023)

arXiv:2306.14583 [pdf, ps, other]

Exploring the Robustness of Large Language Models for Solving Programming Problems

Authors: Atsushi Shirafuji, Yutaka Watanobe, Takumi Ito, Makoto Morishita, Yuki Nakamura, Yusuke Oda, Jun Suzuki

Abstract: Using large language models (LLMs) for source code has recently gained attention. LLMs, such as Transformer-based models like Codex and ChatGPT, have been shown to be highly capable of solving a wide range of programming problems. However, the extent to which LLMs understand problem descriptions and generate programs accordingly or just retrieve source code from the most relevant problem in traini… ▽ More Using large language models (LLMs) for source code has recently gained attention. LLMs, such as Transformer-based models like Codex and ChatGPT, have been shown to be highly capable of solving a wide range of programming problems. However, the extent to which LLMs understand problem descriptions and generate programs accordingly or just retrieve source code from the most relevant problem in training data based on superficial cues has not been discovered yet. To explore this research question, we conduct experiments to understand the robustness of several popular LLMs, CodeGen and GPT-3.5 series models, capable of tackling code generation tasks in introductory programming problems. Our experimental results show that CodeGen and Codex are sensitive to the superficial modifications of problem descriptions and significantly impact code generation performance. Furthermore, we observe that Codex relies on variable names, as randomized variables decrease the solved rate significantly. However, the state-of-the-art (SOTA) models, such as InstructGPT and ChatGPT, show higher robustness to superficial modifications and have an outstanding capability for solving programming problems. This highlights the fact that slight modifications to the prompts given to the LLMs can greatly affect code generation performance, and careful formatting of prompts is essential for high-quality code generation, while the SOTA models are becoming more robust to perturbations. △ Less

Submitted 26 June, 2023; originally announced June 2023.

arXiv:2208.05978 [pdf, other]

doi 10.1103/PhysRevLett.131.210802

Quantum Crosstalk Robust Quantum Control

Authors: Zeyuan Zhou, Ryan Sitler, Yasuo Oda, Kevin Schultz, Gregory Quiroz

Abstract: The prevalence of quantum crosstalk in current quantum devices poses challenges for achieving high-fidelity quantum logic operations and reliable quantum processing. Through quantum control theory, we develop an analytical condition for achieving crosstalk-robust single-qubit control of multi-qubit systems. We examine the effects of quantum crosstalk via a cumulant expansion and develop a conditio… ▽ More The prevalence of quantum crosstalk in current quantum devices poses challenges for achieving high-fidelity quantum logic operations and reliable quantum processing. Through quantum control theory, we develop an analytical condition for achieving crosstalk-robust single-qubit control of multi-qubit systems. We examine the effects of quantum crosstalk via a cumulant expansion and develop a condition to suppress the leading order contributions to the dynamics. The efficacy of the condition is illustrated in the domains of quantum state preservation and noise characterization through the development of crosstalk-robust dynamical decoupling and quantum noise spectroscopy (QNS) protocols. Using the IBM Quantum Experience, crosstalk-robust state preservation is demonstrated on 27 qubits, where a $3\times$ improvement in coherence decay is observed for single-qubit product and multipartite entangled states. Through the use of noise injection, we experimentally demonstrate crosstalk-robust dephasing QNS on a seven qubit processor, where a $10^4$ improvement in reconstruction accuracy over ``cross-susceptible" alternatives is found. Together, these experiments highlight the significant impact the crosstalk mitigation condition can have on improving multi-qubit characterization and control on current quantum devices. △ Less

Submitted 20 November, 2022; v1 submitted 11 August, 2022; originally announced August 2022.

Journal ref: Phys. Rev. Lett. 131, 210802 (2023)

arXiv:2207.13870 [pdf, other]

doi 10.1002/spe.3190

Engineering faster double-array Aho-Corasick automata

Authors: Shunsuke Kanda, Koichi Akabe, Yusuke Oda

Abstract: Multiple pattern matching in strings is a fundamental problem in text processing applications such as regular expressions or tokenization. This paper studies efficient implementations of double-array Aho-Corasick automata (DAACs), data structures for quickly performing the multiple pattern matching. The practical performance of DAACs is improved by carefully designing the data structure, and many… ▽ More Multiple pattern matching in strings is a fundamental problem in text processing applications such as regular expressions or tokenization. This paper studies efficient implementations of double-array Aho-Corasick automata (DAACs), data structures for quickly performing the multiple pattern matching. The practical performance of DAACs is improved by carefully designing the data structure, and many implementation techniques have been proposed thus far. A problem in DAACs is that their ideas are not aggregated. Since comprehensive descriptions and experimental analyses are unavailable, engineers face difficulties in implementing an efficient DAAC. In this paper, we review implementation techniques for DAACs and provide a comprehensive description of them. We also propose several new techniques for further improvement. We conduct exhaustive experiments through real-world datasets and reveal the best combination of techniques to achieve a higher performance in DAACs. The best combination is different from those used in the most popular libraries of DAACs, which demonstrates that their performance can be further enhanced. On the basis of our experimental analysis, we developed a new Rust library for fast multiple pattern matching using DAACs, named Daachorse, as open-source software at https://github.com/daac-tools/daachorse. Experiments demonstrate that Daachorse outperforms other AC-automaton implementations, indicating its suitability as a fast alternative for multiple pattern matching in many applications. △ Less

Submitted 23 June, 2024; v1 submitted 27 July, 2022; originally announced July 2022.

Comments: Accepted by Software: Practice and Experience (Accepted version)

Journal ref: Software: Practice and Experience (SPE), 53(6): 1332-1361, 2023

arXiv:2206.03504 [pdf, other]

Optimally Band-Limited Noise Filtering for Single Qubit Gates

Authors: Yasuo Oda, Dennis Lucarelli, Kevin Schultz, B. David Clader, Gregory Quiroz

Abstract: We introduce a quantum control protocol that produces smooth, experimentally implementable control sequences optimized to combat temporally correlated noise for single qubit systems. The control ansatz is specifically chosen to be a functional expansion of discrete prolate spheroidal sequences, a discrete time basis known to be optimally concentrated in time and frequency, and quite attractive whe… ▽ More We introduce a quantum control protocol that produces smooth, experimentally implementable control sequences optimized to combat temporally correlated noise for single qubit systems. The control ansatz is specifically chosen to be a functional expansion of discrete prolate spheroidal sequences, a discrete time basis known to be optimally concentrated in time and frequency, and quite attractive when faced with experimental control hardware constraints. We leverage the filter function formalism to transform the control problem into a filter design problem, and show that the frequency response of a quantum system can be carefully tailored to avoid the most relevant dynamical contributions of noise processes. Using gradient ascent, we obtain optimized filter functions and exploit them to elucidate important details about the relationship between filter function design, control bandwidth, and noise characteristics. In particular, we identify regimes of optimal noise suppression and in turn, optimal control bandwidth directly proportional to the size of the frequency bands where the noise power is large. In addition to providing guiding principles for filter design, our approach enables the development of controls that simultaneously yield robust noise filtering and high fidelity single qubit logic operations in a wide variety of complex noise environments. △ Less

Submitted 27 January, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

Comments: 26 pages, 12 figures

Journal ref: Phys. Rev. Applied 19, 014062 (2023)

arXiv:2205.09295 [pdf, other]

Are Prompt-based Models Clueless?

Authors: Pride Kavumba, Ryo Takahashi, Yusuke Oda

Abstract: Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement… ▽ More Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement by reusing the language model head and formatting the task input to match the pre-training objective. Therefore, it is expected that few-shot prompt-based models do not exploit superficial cues. This paper presents an empirical examination of whether few-shot prompt-based models also exploit superficial cues. Analyzing few-shot prompt-based models on MNLI, SNLI, HANS, and COPA has revealed that prompt-based models also exploit superficial cues. While the models perform well on instances with superficial cues, they often underperform or only marginally outperform random accuracy on instances without superficial cues. △ Less

Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

arXiv:2204.10894 [pdf, other]

doi 10.1103/PhysRevA.106.022425

Quantum Control Noise Spectroscopy with Optimal Suppression of Dephasing

Authors: Vivian Maloney, Yasuo Oda, Gregory Quiroz, B. David Clader, Leigh M. Norris

Abstract: We extend quantum noise spectroscopy (QNS) of amplitude control noise to settings where dephasing noise or detuning errors make significant contributions to qubit dynamics. Previous approaches to characterize amplitude noise are limited by their vulnerability to low-frequency dephasing noise and static detuning errors, which can overwhelm the target control noise signal and introduce bias into est… ▽ More We extend quantum noise spectroscopy (QNS) of amplitude control noise to settings where dephasing noise or detuning errors make significant contributions to qubit dynamics. Previous approaches to characterize amplitude noise are limited by their vulnerability to low-frequency dephasing noise and static detuning errors, which can overwhelm the target control noise signal and introduce bias into estimates of the amplitude noise spectrum. To overcome this problem, we leverage optimal control to identify a family of amplitude control waveforms that optimally suppress low-frequency dephasing noise and detuning errors, while maintaining the spectral concentration in the amplitude filter essential for spectral estimation. The waveforms found via numerical optimization have surprisingly simple analytic forms, consisting of oscillating sine waves obeying particular amplitude and frequency constraints. In numerically simulated QNS experiments, these waveforms demonstrate superior robustness, enabling accurate estimation of the amplitude noise spectrum in regimes where existing approaches are biased by low-frequency dephasing noise and detuning errors. △ Less

Submitted 5 May, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

Comments: 14 pages + appendices, 7 figures

arXiv:2106.11798 [pdf, ps, other]

doi 10.1093/logcom/exad068

The failure of cut-elimination in cyclic proof for first-order logic with inductive definitions

Authors: Yukihiro Oda, James Brotherston, Makoto Tatsuta

Abstract: A cyclic proof system is a proof system whose proof figure is a tree with cycles. The cut-elimination in a proof system is fundamental. It is conjectured that the cut-elimination in the cyclic proof system for first-order logic with inductive definitions does not hold. This paper shows that the conjecture is correct by giving a sequent not provable without the cut rule but provable in the cyclic p… ▽ More A cyclic proof system is a proof system whose proof figure is a tree with cycles. The cut-elimination in a proof system is fundamental. It is conjectured that the cut-elimination in the cyclic proof system for first-order logic with inductive definitions does not hold. This paper shows that the conjecture is correct by giving a sequent not provable without the cut rule but provable in the cyclic proof system. △ Less

Submitted 14 February, 2024; v1 submitted 22 June, 2021; originally announced June 2021.

Comments: 18 pages

Journal ref: Journal of Logic and Computation, 2023;, exad068

arXiv:1910.13299 [pdf, other]

Findings of the Third Workshop on Neural Generation and Translation

Authors: Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

Abstract: This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019). First, we summarize the research trends of papers presented in the proceedings. Second, we describe the results of the two shared tasks 1) efficient neural machine translation (NMT) where pa… ▽ More This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019). First, we summarize the research trends of papers presented in the proceedings. Second, we describe the results of the two shared tasks 1) efficient neural machine translation (NMT) where participants were tasked with creating NMT systems that are both accurate and efficient, and 2) document-level generation and translation (DGT) where participants were tasked with developing systems that generate summaries from structured data, potentially with assistance from text in another language. △ Less

Submitted 29 October, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

Comments: Fixed the metadata (author list)

arXiv:1806.02940 [pdf, other]

Findings of the Second Workshop on Neural Machine Translation and Generation

Authors: Alexandra Birch, Andrew Finch, Minh-Thang Luong, Graham Neubig, Yusuke Oda

Abstract: This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018). First, we summarize the research trends of papers presented in the proceedings, and note that there is particular interest in linguistic structure, domain adaptation, data augmentation, hand… ▽ More This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018). First, we summarize the research trends of papers presented in the proceedings, and note that there is particular interest in linguistic structure, domain adaptation, data augmentation, handling inadequate resources, and analysis of models. Second, we describe the results of the workshop's shared task on efficient neural machine translation, where participants were tasked with creating MT systems that are both accurate and efficient. △ Less

Submitted 18 June, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

Comments: WNMT 2018

arXiv:1712.00148 [pdf, ps, other]

doi 10.1093/ptep/ptx180

Construction of KAGRA: an Underground Gravitational Wave Observatory

Authors: T. Akutsu, M. Ando, S. Araki, A. Araya, T. Arima, N. Aritomi, H. Asada, Y. Aso, S. Atsuta, K. Awai, L. Baiotti, M. A. Barton, D. Chen, K. Cho, K. Craig, R. DeSalvo, K. Doi, K. Eda, Y. Enomoto, R. Flaminio, S. Fujibayashi, Y. Fujii, M. -K. Fujimoto, M. Fukushima, T. Furuhata , et al. (202 additional authors not shown)

Abstract: Major construction and initial-phase operation of a second-generation gravitational-wave detector KAGRA has been completed. The entire 3-km detector is installed underground in a mine in order to be isolated from background seismic vibrations on the surface. This allows us to achieve a good sensitivity at low frequencies and high stability of the detector. Bare-bones equipment for the interferomet… ▽ More Major construction and initial-phase operation of a second-generation gravitational-wave detector KAGRA has been completed. The entire 3-km detector is installed underground in a mine in order to be isolated from background seismic vibrations on the surface. This allows us to achieve a good sensitivity at low frequencies and high stability of the detector. Bare-bones equipment for the interferometer operation has been installed and the first test run was accomplished in March and April of 2016 with a rather simple configuration. The initial configuration of KAGRA is named {\it iKAGRA}. In this paper, we summarize the construction of KAGRA, including the study of the advantages and challenges of building an underground detector and the operation of the iKAGRA interferometer together with the geophysics interferometer that has been constructed in the same tunnel. △ Less

Submitted 11 December, 2017; v1 submitted 30 November, 2017; originally announced December 2017.

Comments: Resolution of some figures has been decreased from its original version submitted to a journal

Journal ref: Progress of Theoretical and Experimental Physics, Vol 2018, 1, 013F01

arXiv:1706.05765 [pdf, other]

An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation

Authors: Makoto Morishita, Yusuke Oda, Graham Neubig, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

Abstract: Training of neural machine translation (NMT) models usually uses mini-batches for efficiency purposes. During the mini-batched training process, it is necessary to pad shorter sentences in a mini-batch to be equal in length to the longest sentence therein for efficient computation. Previous work has noted that sorting the corpus based on the sentence length before making mini-batches reduces the a… ▽ More Training of neural machine translation (NMT) models usually uses mini-batches for efficiency purposes. During the mini-batched training process, it is necessary to pad shorter sentences in a mini-batch to be equal in length to the longest sentence therein for efficient computation. Previous work has noted that sorting the corpus based on the sentence length before making mini-batches reduces the amount of padding and increases the processing speed. However, despite the fact that mini-batch creation is an essential step in NMT training, widely used NMT toolkits implement disparate strategies for doing so, which have not been empirically validated or compared. This work investigates mini-batch creation strategies with experiments over two different datasets. Our results suggest that the choice of a mini-batch creation strategy has a large effect on NMT training and some length-based sorting strategies do not always work well compared with simple shuffling. △ Less

Submitted 18 June, 2017; originally announced June 2017.

Comments: 8 pages, accepted to the First Workshop on Neural Machine Translation

arXiv:1704.06918 [pdf, ps, other]

Neural Machine Translation via Binary Code Prediction

Authors: Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura

Abstract: In this paper, we propose a new method for calculating the output layer in neural machine translation systems. The method is based on predicting a binary code for each word and can reduce computation time/memory requirements of the output layer to be logarithmic in vocabulary size in the best case. In addition, we also introduce two advanced approaches to improve the robustness of the proposed mod… ▽ More In this paper, we propose a new method for calculating the output layer in neural machine translation systems. The method is based on predicting a binary code for each word and can reduce computation time/memory requirements of the output layer to be logarithmic in vocabulary size in the best case. In addition, we also introduce two advanced approaches to improve the robustness of the proposed model: using error-correcting codes and combining softmax and binary codes. Experiments on two English-Japanese bidirectional translation tasks show proposed models achieve BLEU scores that approach the softmax, while reducing memory usage to the order of less than 1/10 and improving decoding speed on CPUs by x5 to x10. △ Less

Submitted 23 April, 2017; originally announced April 2017.

Comments: Accepted as a long paper at ACL2017

arXiv:1701.03980 [pdf, other]

DyNet: The Dynamic Neural Network Toolkit

Authors: Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn, Kevin Duh, Manaal Faruqui, Cynthia Gan, Dan Garrette, Yangfeng Ji, Lingpeng Kong, Adhiguna Kuncoro, Gaurav Kumar, Chaitanya Malaviya, Paul Michel, Yusuke Oda, Matthew Richardson, Naomi Saphra, Swabha Swayamdipta, Pengcheng Yin

Abstract: We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its deriva… ▽ More We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its derivatives. In DyNet's dynamic declaration strategy, computation graph construction is mostly transparent, being implicitly constructed by executing procedural code that computes the network outputs, and the user is free to use different network structures for each input. Dynamic declaration thus facilitates the implementation of more complicated network architectures, and DyNet is specifically designed to allow users to implement their models in a way that is idiomatic in their preferred programming language (C++ or Python). One challenge with dynamic declaration is that because the symbolic computation graph is defined anew for every training example, its construction must have low overhead. To achieve this, DyNet has an optimized C++ backend and lightweight graph representation. Experiments show that DyNet's speeds are faster than or comparable with static declaration toolkits, and significantly faster than Chainer, another dynamic declaration toolkit. DyNet is released open-source under the Apache 2.0 license and available at http://github.com/clab/dynet. △ Less

Submitted 14 January, 2017; originally announced January 2017.

Comments: 33 pages

arXiv:0901.2382 [pdf, ps, other]

doi 10.1088/0953-8984/21/7/075703

Point-contact spectroscopy of the heavy-fermion superconductor CePt$_{3}$Si

Authors: R. Onuki, A. Sumiyama, Y. Oda, T. Yasuda, R. Settai, Y. Onuki

Abstract: Differential resistance spectra (${\rm d}V/{\rm d}I-V$ characteristics) have been measured for point-contacts between the heavy-fermion superconductor (HFS) CePt$_{3}$Si and a normal metal. Some contacts show a peak at V=0 that is characteristic of HFS coexisting with a magnetic order such as UPd$_2$Al$_3$, UNi$_2$Al$_3$ and URu$_2$Si$_2$. The evolution of the peak occurs well above the antiferr… ▽ More Differential resistance spectra (${\rm d}V/{\rm d}I-V$ characteristics) have been measured for point-contacts between the heavy-fermion superconductor (HFS) CePt$_{3}$Si and a normal metal. Some contacts show a peak at V=0 that is characteristic of HFS coexisting with a magnetic order such as UPd$_2$Al$_3$, UNi$_2$Al$_3$ and URu$_2$Si$_2$. The evolution of the peak occurs well above the antiferromagnetic transition temperature $T_{\rm N}\sim$ 2.2 K, so that the direct relationship with the magnetic transition is questionable. The half-width of the peak seems to reflect the crystal field splitting or the spin-wave gap as observed for the above-mentioned HFSs, possibly suggesting that some common scattering process induces the zero-bias peaks in these materials. △ Less

Submitted 15 January, 2009; originally announced January 2009.

Comments: 9 pages, 5 figures, To be published in J. Phys.: Condens. Matter

arXiv:0811.4011 [pdf, ps, other]

doi 10.1143/JPSJ.77.123710

Electrical Resistivity and Thermal Expansion Measurements of URu2Si2 under Pressure

Authors: Gaku Motoyama, Nobuyuki Yokoyama, Akihiko Sumiyama, Yasukage Oda

Abstract: We carried out simultaneous measurements of electrical resistivity and thermal expansion of the heavy-fermion compound URu2Si2 under pressure using a single crystal. We observed a phase transition anomaly between hidden (HO) and antiferromagnetic (AFM) ordered states at TM in the temperature dependence of both measurements. For the electrical resistivity, the anomaly at TM was very small compare… ▽ More We carried out simultaneous measurements of electrical resistivity and thermal expansion of the heavy-fermion compound URu2Si2 under pressure using a single crystal. We observed a phase transition anomaly between hidden (HO) and antiferromagnetic (AFM) ordered states at TM in the temperature dependence of both measurements. For the electrical resistivity, the anomaly at TM was very small compared with the distinct hump anomaly at the phase transition temperature T0 between the paramagnetic state (PM) and HO, and exhibited only a slight increase and decrease for the I // a-axis and c-axis, respectively. We estimated each excitation gap of HO, Delta_HO, and AFM, Delta_AFM, from the temperature dependence of electrical resistivity; Delta_HO and Delta_AFM have different pressure dependences from each other. On the other hand, the temperature dependence of thermal expansion exhibited a small anomaly at T0 and a large anomaly at TM. The pressure dependence of the phase boundaries of T0 and TM indicates that there is no critical end point and the two phase boundaries meet at the critical point. △ Less

Submitted 24 November, 2008; originally announced November 2008.

Comments: 4 pages, 4 figures

arXiv:0802.2759 [pdf, ps, other]

doi 10.1143/JPSJ.77.044710

Specific Heat Study of Magnetic and Superconducting Transitions in CePt3Si

Authors: Gaku Motoyama, Katsuhiro Maeda, Yasukage Oda

Abstract: Measurements of specific heat between 80 mK to 4 K and electrical resistivity between 80 mK to 10 K were carried out for polycrystalline CePt3Si samples cut into small pieces (typically $\sim $10 mg). In the specific heat measurements, we observed an antiferromagnetic transition jump at TN = 2.2 K for all the samples, while the heights have large variations. As regards superconductivity, we obse… ▽ More Measurements of specific heat between 80 mK to 4 K and electrical resistivity between 80 mK to 10 K were carried out for polycrystalline CePt3Si samples cut into small pieces (typically $\sim $10 mg). In the specific heat measurements, we observed an antiferromagnetic transition jump at TN = 2.2 K for all the samples, while the heights have large variations. As regards superconductivity, we observed two distinct transition jumps at Tcl $\sim$ 0.45 K and Tch $\sim$ 0.75 K, which were the same for all the samples. From the measurements of specific heat and resistivity, systematic relations were found between antiferromagnetic and superconducting transitions. We conclude that antiferromagnetism, whose transition temperature is 2.2 K, coexists with superconductivity, whose transition temperature is Tcl. In this sample, residual electronic specific heat coefficient in the superconducting state $γ_{\rm s}$ was quite small, and specific heat divided by temperature below Tcl decreased almost linearly with decreasing temperature. In order to reveal the characteristic properties of the magnetism and superconductivity of the CePt3Si system, it is important to study the two superconducting phases with Tcl and Tch, respectively. △ Less

Submitted 20 February, 2008; originally announced February 2008.

Comments: 5 pages, 4 figures

Journal ref: J. Phys. Soc. Jpn. Vol.77 (2008)

arXiv:0709.0834 [pdf, ps, other]

doi 10.1143/JPSJ.76.114708

AC/DC Susceptibility of the Heavy-Fermion Superconductor CePt3Si under Pressure

Authors: Yoshihiro Aoki, Akihiko Sumiyama, Gaku Motoyama, Yasukage Oda, Yasuda Settai, Yoshichika Onuki

Abstract: We have investigated the pressure dependence of ac and dc susceptibilities of the heavy-fermion superconductor CePt3Si (Tc= 0.75 K) that coexists with antiferromagnetism (TN = 2.2 K). As hydrostatic pressure is increased, Tc first decreases rapidly, then rather slowly near the critical pressure Pc = 0.6 GPa and shows a stronger decrease again at higher pressures, where Pc is the pressure at whic… ▽ More We have investigated the pressure dependence of ac and dc susceptibilities of the heavy-fermion superconductor CePt3Si (Tc= 0.75 K) that coexists with antiferromagnetism (TN = 2.2 K). As hydrostatic pressure is increased, Tc first decreases rapidly, then rather slowly near the critical pressure Pc = 0.6 GPa and shows a stronger decrease again at higher pressures, where Pc is the pressure at which TN becomes zero. A transition width and a difference in the two transition temperatures defined in the form of structures in the out-of-phase component of ac susceptibilities also become small near Pc, indicating that a double transition observed in CePt3Si is caused by some inhomogeneous property in the sample that leads to a spatial variation of local pressure. A sudden increase in the Meissner fraction above Pc suggests the influence of antiferromagnetism on superconductivity. △ Less

Submitted 6 September, 2007; originally announced September 2007.

Comments: 4 pages with 5 figures. This paper will be published in J. Phys. Soc. Jpn

arXiv:cond-mat/0409638 [pdf, ps, other]

doi 10.1016/j.physb.2005.01.065

NMR study of electronic state in CePt3Si

Authors: K. Ueda, K. Hamamoto, T. Kohara, G. Motoyama, Y. Oda

Abstract: In this article, we report the temperature dependence of spin-lattice relaxation rates at two Pt sites and one Si site in CePt3Si with a non-centrosymmetric structure center. 1/T1 for both Pt sites between 2 K and 300 K and 1/T1 of Si above 3 K might be explained by the contributions from the low-lying crystal-electric-field level and the quasiparticle due to the hybridization between the ground… ▽ More In this article, we report the temperature dependence of spin-lattice relaxation rates at two Pt sites and one Si site in CePt3Si with a non-centrosymmetric structure center. 1/T1 for both Pt sites between 2 K and 300 K and 1/T1 of Si above 3 K might be explained by the contributions from the low-lying crystal-electric-field level and the quasiparticle due to the hybridization between the ground state and conduction electrons. Just below Tc no remarkable enhancement in 1/T1 was observed. The estimated value of superconducting gap is about 2Delta = 3kBTc. △ Less

Submitted 24 September, 2004; originally announced September 2004.

Comments: 2 pages with 2 EPS figures. uses phb-proc4-auth.cls. Accepted for publication in Physica B

arXiv:cond-mat/0308528 [pdf, ps, other]

doi 10.1103/PhysRevB.68.132502

Proximity-induced superconductivity in platinum metals

Authors: D. Katayama, A. Sumiyama, Y. Oda

Abstract: The diamagnetism of platinum metals (N: Rh, Pt, Pd), which is induced by the proximity effect of a superconductor (S: Nb), has been investigated for N-S double layers. Notwithstanding the strong spin fluctuation in platinum metals, the screening distance ρin N increases with a decrease in temperature and reaches a value which is expected in comparison with ρin Cu. When magnetic impurities are in… ▽ More The diamagnetism of platinum metals (N: Rh, Pt, Pd), which is induced by the proximity effect of a superconductor (S: Nb), has been investigated for N-S double layers. Notwithstanding the strong spin fluctuation in platinum metals, the screening distance ρin N increases with a decrease in temperature and reaches a value which is expected in comparison with ρin Cu. When magnetic impurities are included in N, the proximity effect is drastically suppressed and the paramagnetism due to a giant moment is observed. △ Less

Submitted 26 August, 2003; originally announced August 2003.

Comments: 4 pages, 4 figures, to be published in Phys. Rev. B

Journal ref: Phys. Rev. B 68, 132502 (2003)

arXiv:cond-mat/0011176 [pdf, ps, other]

doi 10.1143/JPSJ.70.228

Diamagnetic Response of Normal-Superconducting Double Layers

Authors: A. Sumiyama, T. Endo, Y. Nakagawa, Y. Oda

Abstract: The diamagnetism of a normal metal (N: Cu or Au), which is induced by the proximity effect of a superconductor (S: Nb), has been investigated for N-S double layers, which are formed by a thin-film deposition process. Detailed studies of samples, which have different electronic mean-free path \ell_N in N, suggest that \ell_N should be controlled by the impurity concentration rather than the mecha… ▽ More The diamagnetism of a normal metal (N: Cu or Au), which is induced by the proximity effect of a superconductor (S: Nb), has been investigated for N-S double layers, which are formed by a thin-film deposition process. Detailed studies of samples, which have different electronic mean-free path \ell_N in N, suggest that \ell_N should be controlled by the impurity concentration rather than the mechanical imperfections in the lattice in order to clarify the \ell_N dependence of the proximity effect. Both the screening distance ρin N and the parameter νin ρ\propto T^-νincrease with an increase in \ell_N. This result can be understood on the assumption that the normal metal changes its behavior from the "dirty" limit (ξ_N>\ell_N) to the "clean" limit (ξ_N<\ell_N), where ξ_N is the coherence length in N. △ Less

Submitted 10 November, 2000; originally announced November 2000.

Comments: 5 pages, 6 Postscript figures, to be published in J. Phys. Soc. Jpn

Journal ref: J. Phys. Soc. Jpn. 70 (2001) 228.

Showing 1–23 of 23 results for author: Oda, Y