Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Ou, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15176  [pdf, other

    cs.SD cs.CL eess.AS

    Unlocking Potential in Pre-Trained Music Language Models for Versatile Multi-Track Music Arrangement

    Authors: Longshen Ou, Jingwei Zhao, Ziyu Wang, Gus Xia, Ye Wang

    Abstract: Large language models have shown significant capabilities across various domains, including symbolic music generation. However, leveraging these pre-trained models for controllable music arrangement tasks, each requiring different forms of musical information as control, remains a novel challenge. In this paper, we propose a unified sequence-to-sequence framework that enables the fine-tuning of a… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: Submitted to AAAI 2025

  2. arXiv:2408.04249  [pdf, other

    cs.CV

    InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting

    Authors: Xin-Yi Yu, Jun-Xin Yu, Li-Bo Zhou, Yan Wei, Lin-Lin Ou

    Abstract: We present InstantStyleGaussian, an innovative 3D style transfer method based on the 3D Gaussian Splatting (3DGS) scene representation. By inputting a target-style image, it quickly generates new 3D GS scenes. Our method operates on pre-reconstructed GS scenes, combining diffusion models with an improved iterative dataset update strategy. It utilizes diffusion models to generate target style image… ▽ More

    Submitted 26 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

  3. arXiv:2408.00294  [pdf, other

    cs.CV cs.IR

    RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace

    Authors: Lu Ou, Shaolin Liao, Shihui Gao, Guandong Huang, Zheng Qi

    Abstract: With the widespread sharing of personal face images in applications' public databases, face recognition systems faces real threat of being breached by potential adversaries who are able to access users' face images and use them to intrude the face recognition systems. In this paper, we propose a novel privacy protection method in the multiscale sparsified feature subspaces to protect sensitive fac… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 13 pages, 6 figures

  4. arXiv:2403.01214  [pdf, other

    cs.CV

    Boosting Box-supervised Instance Segmentation with Pseudo Depth

    Authors: Xinyi Yu, Ling Yan, Pengtao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ou

    Abstract: The realm of Weakly Supervised Instance Segmentation (WSIS) under box supervision has garnered substantial attention, showcasing remarkable advancements in recent years. However, the limitations of box supervision become apparent in its inability to furnish effective information for distinguishing foreground from background within the specified target box. This research addresses this challenge by… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  5. arXiv:2309.09739  [pdf, other

    cs.CV

    Improving Neural Indoor Surface Reconstruction with Mask-Guided Adaptive Consistency Constraints

    Authors: Xinyi Yu, Liqin Lu, Jintao Rong, Guangkai Xu, Linlin Ou

    Abstract: 3D scene reconstruction from 2D images has been a long-standing task. Instead of estimating per-frame depth maps and fusing them in 3D, recent research leverages the neural implicit surface as a unified representation for 3D reconstruction. Equipped with data-driven pre-trained geometric cues, these methods have demonstrated promising performance. However, inaccurate prior estimation, which is usu… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  6. arXiv:2307.08300  [pdf, other

    cs.CV cs.AI

    ShiftNAS: Improving One-shot NAS via Probability Shift

    Authors: Mingyang Zhang, Xinyi Yu, Haodong Zhao, Linlin Ou

    Abstract: One-shot Neural architecture search (One-shot NAS) has been proposed as a time-efficient approach to obtain optimal subnet architectures and weights under different complexity cases by training only once. However, the subnet performance obtained by weight sharing is often inferior to the performance achieved by retraining. In this paper, we investigate the performance gap and attribute it to the u… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: accepted by iccv 2023

  7. arXiv:2307.02146  [pdf, other

    cs.CL cs.SD eess.AS

    LOAF-M2L: Joint Learning of Wording and Formatting for Singable Melody-to-Lyric Generation

    Authors: Longshen Ou, Xichu Ma, Ye Wang

    Abstract: Despite previous efforts in melody-to-lyric generation research, there is still a significant compatibility gap between generated lyrics and melodies, negatively impacting the singability of the outputs. This paper bridges the singability gap with a novel approach to generating singable lyrics by jointly Learning wOrding And Formatting during Melody-to-Lyric training. After general-domain pretrain… ▽ More

    Submitted 19 July, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: An extension of our previous work arXiv:2305.16816 [cs.CL]

  8. arXiv:2306.02243  [pdf, other

    cs.CV

    Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification

    Authors: Jintao Rong, Hao Chen, Tianxiao Chen, Linlin Ou, Xinyi Yu, Yifan Liu

    Abstract: Prompt learning has become a popular approach for adapting large vision-language models, such as CLIP, to downstream tasks. Typically, prompt learning relies on a fixed prompt token or an input-conditional token to fit a small amount of data under full supervision. While this paradigm can generalize to a certain range of unseen classes, it may struggle when domain gap increases, such as in fine-gr… ▽ More

    Submitted 18 June, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

  9. arXiv:2305.18403  [pdf, other

    cs.LG cs.CV

    LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning

    Authors: Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang

    Abstract: Large Language Models (LLMs), such as LLaMA and T5, have shown exceptional performance across various tasks through fine-tuning. Although low-rank adaption (LoRA) has emerged to cheaply fine-tune these LLMs on downstream tasks, their deployment is still hindered by the vast model scale and computational costs. Post-training model pruning offers a way to compress LLMs. However, the current pruning… ▽ More

    Submitted 6 August, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: accepted by acl 2024 findings

  10. arXiv:2305.17306  [pdf, other

    cs.CL cs.AI cs.LG

    Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance

    Authors: Yao Fu, Litu Ou, Mingyu Chen, Yuhao Wan, Hao Peng, Tushar Khot

    Abstract: As large language models (LLMs) are continuously being developed, their evaluation becomes increasingly important yet challenging. This work proposes Chain-of-Thought Hub, an open-source evaluation suite on the multi-step reasoning capabilities of large language models. We are interested in this setting for two reasons: (1) from the behavior of GPT and PaLM model family, we observe that complex re… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Preprint. Code at https://github.com/FranxYao/chain-of-thought-hub

  11. arXiv:2305.16816  [pdf, other

    cs.CL

    Songs Across Borders: Singable and Controllable Neural Lyric Translation

    Authors: Longshen Ou, Xichu Ma, Min-Yen Kan, Ye Wang

    Abstract: The development of general-domain neural machine translation (NMT) methods has advanced significantly in recent years, but the lack of naturalness and musical constraints in the outputs makes them unable to produce singable lyric translations. This paper bridges the singability quality gap by formalizing lyric translation into a constrained translation problem, converting theoretical guidance and… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023. Camera-ready version

    MSC Class: 68T50

  12. arXiv:2304.12082  [pdf, other

    cs.SD eess.AS

    Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models

    Authors: Xiangming Gu, Wei Zeng, Jianan Zhang, Longshen Ou, Ye Wang

    Abstract: Singing voice transcription converts recorded singing audio to musical notation. Sound contamination (such as accompaniment) and lack of annotated data make singing voice transcription an extremely difficult task. We take two approaches to tackle the above challenges: 1) introducing multimodal learning for singing voice transcription together with a new multimodal singing dataset, N20EMv2, enhanci… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  13. arXiv:2304.09694  [pdf, other

    cs.CV

    CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection

    Authors: Yang Yang, Weijie Ma, Hao Chen, Linlin Ou, Xinyi Yu

    Abstract: The combination of LiDAR and camera modalities is proven to be necessary and typical for 3D object detection according to recent studies. Existing fusion strategies tend to overly rely on the LiDAR modal in essence, which exploits the abundant semantics from the camera sensor insufficiently. However, existing methods cannot rely on information from other modalities because the corruption of LiDAR… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  14. arXiv:2301.12726  [pdf, other

    cs.CL cs.AI cs.LG

    Specializing Smaller Language Models towards Multi-Step Reasoning

    Authors: Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot

    Abstract: The surprising ability of Large Language Models (LLMs) to perform well on complex reasoning with only few-shot chain-of-thought prompts is believed to emerge only in very large-scale models (100+ billion parameters). We show that such abilities can, in fact, be distilled down from GPT-3.5 ($\ge$ 175B) to T5 variants ($\le$ 11B). We propose model specialization, to specialize the model's ability to… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Preprint

  15. arXiv:2207.09747  [pdf, other

    eess.AS cs.SD eess.SP

    Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription

    Authors: Longshen Ou, Xiangming Gu, Ye Wang

    Abstract: Automatic speech recognition (ASR) has progressed significantly in recent years due to the emergence of large-scale datasets and the self-supervised learning (SSL) paradigm. However, as its counterpart problem in the singing domain, the development of automatic lyric transcription (ALT) suffers from limited data and degraded intelligibility of sung lyrics. To fill in the performance gap between AL… ▽ More

    Submitted 16 October, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Camera ready version of ISMIR 2022 submission

  16. arXiv:2207.06127  [pdf, other

    eess.AS cs.SD eess.SP

    MM-ALT: A Multimodal Automatic Lyric Transcription System

    Authors: Xiangming Gu, Longshen Ou, Danielle Ong, Ye Wang

    Abstract: Automatic lyric transcription (ALT) is a nascent field of study attracting increasing interest from both the speech and music information retrieval communities, given its significant application potential. However, ALT with audio data alone is a notoriously difficult task due to instrumental accompaniment and musical constraints resulting in degradation of both the phonetic cues and the intelligib… ▽ More

    Submitted 17 February, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted by ACM Multimedia 2022. Camera ready version and appendix

  17. arXiv:2206.15109  [pdf

    cs.CV

    MKIoU Loss: Towards Accurate Oriented Object Detection in Aerial Images

    Authors: Xinyi Yu, Jiangping Lu, Xinyi Yu, Mi Lin, Linlin Ou

    Abstract: Oriented bounding box regression is crucial for oriented object detection. However, regression-based methods often suffer from boundary problems and the inconsistency between loss and evaluation metrics. In this paper, a modulated Kalman IoU loss of approximate SkewIoU is proposed, named MKIoU. To avoid boundary problems, we convert the oriented bounding box to Gaussian distribution, then use the… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

  18. arXiv:2205.09830  [pdf, ps, other

    cs.CL

    Towards Understanding Gender-Seniority Compound Bias in Natural Language Generation

    Authors: Samhita Honnavalli, Aesha Parekh, Lily Ou, Sophie Groenwold, Sharon Levy, Vicente Ordonez, William Yang Wang

    Abstract: Women are often perceived as junior to their male counterparts, even within the same job titles. While there has been significant progress in the evaluation of gender bias in natural language processing (NLP), existing studies seldom investigate how biases toward gender groups change when compounded with other societal biases. In this work, we investigate how seniority impacts the degree of gender… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 6 pages, LREC 2022

  19. arXiv:2205.02003  [pdf, other

    cs.RO cs.AI

    Multi-subgoal Robot Navigation in Crowds with History Information and Interactions

    Authors: Xinyi Yu, Jianan Hu, Yuehai Fan, Wancai Zheng, Linlin Ou

    Abstract: Robot navigation in dynamic environments shared with humans is an important but challenging task, which suffers from performance deterioration as the crowd grows. In this paper, multi-subgoal robot navigation approach based on deep reinforcement learning is proposed, which can reason about more comprehensive relationships among all agents (robot and humans). Specifically, the next position point i… ▽ More

    Submitted 29 November, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

  20. arXiv:2204.06403  [pdf, other

    cs.AI

    Efficient Re-parameterization Operations Search for Easy-to-Deploy Network Based on Directional Evolutionary Strategy

    Authors: Xinyi Yu, Xiaowei Wang, Jintao Rong, Mingyang Zhang, Linlin Ou

    Abstract: Structural re-parameterization (Rep) methods has achieved significant performance improvement on traditional convolutional network. Most current Rep methods rely on prior knowledge to select the reparameterization operations. However, the performance of architecture is limited by the type of operations and prior knowledge. To break this restriction, in this work, an improved re-parameterization se… ▽ More

    Submitted 3 July, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: 21pages, 8figures

  21. arXiv:2204.03898  [pdf, other

    eess.AS cs.SD

    Exploring Transformer's potential on automatic piano transcription

    Authors: Longshen Ou, Ziyi Guo, Emmanouil Benetos, Jiqing Han, Ye Wang

    Abstract: Most recent research about automatic music transcription (AMT) uses convolutional neural networks and recurrent neural networks to model the mapping from music signals to symbolic notation. Based on a high-resolution piano transcription system, we explore the possibility of incorporating another powerful sequence transformation tool -- the Transformer -- to deal with the AMT problem. We argue that… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted by ICASSP 2022

    ACM Class: H.5.5

  22. Real-time Rail Recognition Based on 3D Point Clouds

    Authors: Xinyi Yu, Weiqi He, Xuecheng Qian, Yang Yang, Linlin Ou

    Abstract: Accurate rail location is a crucial part in the railway support driving system for safety monitoring. LiDAR can obtain point clouds that carry 3D information for the railway environment, especially in darkness and terrible weather conditions. In this paper, a real-time rail recognition method based on 3D point clouds is proposed to solve the challenges, such as disorderly, uneven density and large… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  23. arXiv:2112.15358  [pdf, other

    cs.CV

    Conditional Generative Data-free Knowledge Distillation

    Authors: Xinyi Yu, Ling Yan, Yang Yang, Libo Zhou, Linlin Ou

    Abstract: Knowledge distillation has made remarkable achievements in model compression. However, most existing methods require the original training data, which is usually unavailable due to privacy and security issues. In this paper, we propose a conditional generative data-free knowledge distillation (CGDD) framework for training lightweight networks without any training data. This method realizes efficie… ▽ More

    Submitted 12 August, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

  24. arXiv:2111.02283  [pdf, other

    cs.RO eess.SY

    A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

    Authors: Xinyi Yu, Siyu Xu, Yuehai Fan, Linlin Ou

    Abstract: To solve the coupling problem of control loops and the adaptive parameter tuning problem in the multi-input multi-output (MIMO) PID control system, a self-adaptive LSAC-PID algorithm is proposed based on deep reinforcement learning (RL) and Lyapunov-based reward shaping in this paper. For complex and unknown mobile robot control environment, an RL-based MIMO PID hybrid control strategy is firstly… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 11 pages, 13 figures

  25. arXiv:2110.05842  [pdf, other

    cs.LG

    Across-Task Neural Architecture Search via Meta Learning

    Authors: Jingtao Rong, Xinyi Yu, Mingyang Zhang, Linlin Ou

    Abstract: Adequate labeled data and expensive compute resources are the prerequisites for the success of neural architecture search(NAS). It is challenging to apply NAS in meta-learning scenarios with limited compute resources and data. In this paper, an across-task neural architecture search (AT-NAS) is proposed to address the problem through combining gradient-based meta-learning with EA-based NAS to lear… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  26. arXiv:2109.10187  [pdf

    cs.CV cs.AI

    Oriented Object Detection in Aerial Images Based on Area Ratio of Parallelogram

    Authors: Xinyi Yu, Mi Lin, Jiangping Lu, Linlin Ou

    Abstract: Oriented object detection is a challenging task in aerial images since the objects in aerial images are displayed in arbitrary directions and are frequently densely packed. The mainstream detectors describe rotating objects using a five-parament or eight-parament representations, which suffer from representation ambiguity for orientated object definition. In this paper, we propose a novel represen… ▽ More

    Submitted 8 November, 2021; v1 submitted 21 September, 2021; originally announced September 2021.

  27. arXiv:2109.03508  [pdf, other

    cs.LG cs.NE

    RepNAS: Searching for Efficient Re-parameterizing Blocks

    Authors: Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou

    Abstract: In the past years, significant improvements in the field of neural architecture search(NAS) have been made. However, it is still challenging to search for efficient networks due to the gap between the searched constraint and real inference time exists. To search for a high-performance network with low inference time, several previous works set a computational complexity constraint for the search a… ▽ More

    Submitted 14 June, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

  28. arXiv:2108.04123  [pdf, other

    cs.ET

    DP-DNA: A Digital Pattern-Aware DNA Storage System to Improve Encoding Density

    Authors: Bingzhe Li, Li Ou, David Du

    Abstract: With the rapid increase of available digital data, DNA storage is identified as a storage media with high density and capability of long-term preservation, especially for archival storage systems. However, the encoding density (i.e., how many binary bits can be encoded into one nucleotide) and error handling are two major factors intertwined in DNA storage. Considering encoding density, theoretica… ▽ More

    Submitted 24 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: 14 pages, 13 figures

  29. Pedestrian Attribute Recognition in Video Surveillance Scenarios Based on View-attribute Attention Localization

    Authors: Weichen Chen, Xinyi Yu, Linlin Ou

    Abstract: Pedestrian attribute recognition in surveillance scenarios is still a challenging task due to the inaccurate localization of specific attributes. In this paper, we propose a novel view-attribute localization method based on attention (VALA), which utilizes view information to guide the recognition process to focus on specific attributes and attention mechanism to localize specific attribute-corres… ▽ More

    Submitted 19 December, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Journal ref: Springer2022-Machine Intelligence Research

  30. A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning for Mobile Robots

    Authors: Xinyi Yu, Yuehai Fan, Siyu Xu, Linlin Ou

    Abstract: Proportional-integral-derivative (PID) control is the most widely used in industrial control, robot control and other fields. However, traditional PID control is not competent when the system cannot be accurately modeled and the operating environment is variable in real time. To tackle these problems, we propose a self-adaptive model-free SAC-PID control approach based on reinforcement learning fo… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 20 oages, 12 figures

    Journal ref: Int J Robust Nolinear Control 31 (2021) 1-19

  31. IMG-DNA: Approximate DNA Storage for Images

    Authors: Bingzhe Li, Li Ou, David Du

    Abstract: Deoxyribonucleic Acid (DNA) as a storage medium with high density and long-term preservation properties can satisfy the requirement of archival storage for rapidly increased digital volume. The read and write processes of DNA storage are error-prone. Images widely used in social media have the properties of fault tolerance which are well fitted to the DNA storage. However, prior work simply invest… ▽ More

    Submitted 29 May, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 11 pages, 12 figures

  32. arXiv:2011.04908  [pdf, other

    cs.CV

    Effective Model Compression via Stage-wise Pruning

    Authors: Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou

    Abstract: Automated Machine Learning(Auto-ML) pruning methods aim at searching a pruning strategy automatically to reduce the computational complexity of deep Convolutional Neural Networks(deep CNNs). However, some previous work found that the results of many Auto-ML pruning methods cannot even surpass the results of the uniformly pruning method. In this paper, the ineffectiveness of Auto-ML pruning which i… ▽ More

    Submitted 22 September, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

  33. arXiv:2010.08412  [pdf, other

    cs.CL cs.AR

    Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications

    Authors: Matthew Khoury, Rumen Dangovski, Longwu Ou, Preslav Nakov, Yichen Shen, Li Jing

    Abstract: Deep neural networks have become the standard approach to building reliable Natural Language Processing (NLP) applications, ranging from Neural Machine Translation (NMT) to dialogue systems. However, improving accuracy by increasing the model size requires a large number of hardware computations, which can slow down NLP applications significantly at inference time. To address this issue, we propos… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: To appear at the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP '20), November 16-20, 2020, NMT, AI accelerators, co-design, TPU, OPU, 10 pages, 3 figures, 4 tables

  34. arXiv:2010.02510  [pdf, other

    cs.CL cs.AI

    Investigating African-American Vernacular English in Transformer-Based Text Generation

    Authors: Sophie Groenwold, Lily Ou, Aesha Parekh, Samhita Honnavalli, Sharon Levy, Diba Mirza, William Yang Wang

    Abstract: The growth of social media has encouraged the written use of African American Vernacular English (AAVE), which has traditionally been used only in oral contexts. However, NLP models have historically been developed using dominant English varieties, such as Standard American English (SAE), due to text corpora availability. We investigate the performance of GPT-2 on AAVE text by creating a dataset o… ▽ More

    Submitted 29 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 7 pages, EMNLP 2020

  35. arXiv:2007.13881  [pdf, other

    cs.CE physics.app-ph

    iESC: iterative Equivalent Surface Current Approximation

    Authors: Shaolin Liao, Lu Ou

    Abstract: A novel iterative Equivalent Surface Current (iESC) algorithm has been developed to simulate the electromagnetic scattering of electrically large dielectric objects with relatively smooth surfaces. The iESC algorithm corrects the surface currents to compensate for the electromagnetic field deviation across the dielectric surface. Numerically validation has been performed with a dielectric sphere t… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 10 pages, 7 figures

  36. arXiv:2004.13939  [pdf, ps, other

    cs.CL

    Evaluating Transformer-Based Multilingual Text Classification

    Authors: Sophie Groenwold, Samhita Honnavalli, Lily Ou, Aesha Parekh, Sharon Levy, Diba Mirza, William Yang Wang

    Abstract: As NLP tools become ubiquitous in today's technological landscape, they are increasingly applied to languages with a variety of typological structures. However, NLP research does not focus primarily on typological differences in its analysis of state-of-the-art language models. As a result, NLP tools perform unequally across languages with different syntactic and morphological structures. Through… ▽ More

    Submitted 30 April, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Total of 15 pages (9 pages for paper, 2 pages for references, 4 pages for appendix). Changed title

  37. arXiv:2003.01751  [pdf, other

    cs.LG stat.ML

    Automatic Hyper-Parameter Optimization Based on Mapping Discovery from Data to Hyper-Parameters

    Authors: Bozhou Chen, Kaixin Zhang, Longshen Ou, Chenmin Ba, Hongzhi Wang, Chunnan Wang

    Abstract: Machine learning algorithms have made remarkable achievements in the field of artificial intelligence. However, most machine learning algorithms are sensitive to the hyper-parameters. Manually optimizing the hyper-parameters is a common method of hyper-parameter tuning. However, it is costly and empirically dependent. Automatic hyper-parameter optimization (autoHPO) is favored due to its effective… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  38. arXiv:1911.09817  [pdf, other

    cs.CV

    Graph Pruning for Model Compression

    Authors: Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou

    Abstract: Previous AutoML pruning works utilized individual layer features to automatically prune filters. We analyze the correlation for two layers from the different blocks which have a short-cut structure. It shows that, in one block, the deeper layer has many redundant filters which can be represented by filters in the former layer. So, it is necessary to take information from other layers into consider… ▽ More

    Submitted 22 September, 2021; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: accepted by Applied Intelligence