Zum Hauptinhalt springen

Showing 1–26 of 26 results for author: Shavit, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10994  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning

    Authors: Armand Nicolicioiu, Eugenia Iofinova, Eldar Kurtic, Mahdi Nikdan, Andrei Panferov, Ilia Markov, Nir Shavit, Dan Alistarh

    Abstract: The availability of powerful open-source large language models (LLMs) opens exciting use-cases, such as automated personal assistants that adapt to the user's unique data and demands. Two key desiderata for such assistants are personalization-in the sense that the assistant should reflect the user's own style-and privacy-in the sense that users may prefer to always store their personal data locall… ▽ More

    Submitted 24 June, 2024; originally announced July 2024.

    Comments: Panza is available at https://github.com/IST-DASLab/PanzaMail

  2. arXiv:2405.15756  [pdf, other

    cs.LG cs.AI

    Sparse Expansion and Neuronal Disentanglement

    Authors: Shashata Sawmya, Linghao Kong, Ilia Markov, Dan Alistarh, Nir Shavit

    Abstract: We show how to improve the inference efficiency of an LLM by expanding it into a mixture of sparse experts, where each expert is a copy of the original weights, one-shot pruned for a specific cluster of input values. We call this approach $\textit{Sparse Expansion}$. We show that, for models such as Llama 2 70B, as we increase the number of sparse experts, Sparse Expansion outperforms all other on… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures

  3. arXiv:2312.08793  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Forbidden Facts: An Investigation of Competing Objectives in Llama-2

    Authors: Tony T. Wang, Miles Wang, Kaivalya Hariharan, Nir Shavit

    Abstract: LLMs often face competing pressures (for example helpfulness vs. harmlessness). To understand how models resolve such conflicts, we study Llama-2-chat models on the forbidden fact task. Specifically, we instruct Llama-2 to truthfully complete a factual recall statement while forbidding it from saying the correct answer. This often makes the model give incorrect answers. We decompose Llama-2 into 1… ▽ More

    Submitted 31 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted to the ATTRIB and SoLaR workshops at NeurIPS 2023; (v3: clarified experimental details)

  4. arXiv:2303.00882  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    X-Ray2EM: Uncertainty-Aware Cross-Modality Image Reconstruction from X-Ray to Electron Microscopy in Connectomics

    Authors: Yicong Li, Yaron Meirovitch, Aaron T. Kuan, Jasper S. Phelps, Alexandra Pacureanu, Wei-Chung Allen Lee, Nir Shavit, Lu Mi

    Abstract: Comprehensive, synapse-resolution imaging of the brain will be crucial for understanding neuronal computations and function. In connectomics, this has been the sole purview of volume electron microscopy (EM), which entails an excruciatingly difficult process because it requires cutting tissue into many thin, fragile slices that then need to be imaged, aligned, and reconstructed. Unlike EM, hard X-… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by ISBI 2023 conference. Supplementary material is available in this arXiv version

  5. arXiv:2302.07348  [pdf, other

    cs.LG cs.AI stat.ML

    Cliff-Learning

    Authors: Tony T. Wang, Igor Zablotchi, Nir Shavit, Jonathan S. Rosenfeld

    Abstract: We study the data-scaling of transfer learning from foundation models in the low-downstream-data regime. We observe an intriguing phenomenon which we call cliff-learning. Cliff-learning refers to regions of data-scaling laws where performance improves at a faster than power law rate (i.e. regions of concavity on a log-log scaling plot). We conduct an in-depth investigation of foundation-model clif… ▽ More

    Submitted 6 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 16 pages; v2 updates: improved layout, added acknowledgements

  6. arXiv:2302.03819  [pdf, other

    cs.CV cs.LG q-bio.NC

    The XPRESS Challenge: Xray Projectomic Reconstruction -- Extracting Segmentation with Skeletons

    Authors: Tri Nguyen, Mukul Narwani, Mark Larson, Yicong Li, Shuhan Xie, Hanspeter Pfister, Donglai Wei, Nir Shavit, Lu Mi, Alexandra Pacureanu, Wei-Chung Lee, Aaron T. Kuan

    Abstract: The wiring and connectivity of neurons form a structural basis for the function of the nervous system. Advances in volume electron microscopy (EM) and image segmentation have enabled mapping of circuit diagrams (connectomics) within local regions of the mouse brain. However, applying volume EM over the whole brain is not currently feasible due to technological challenges. As a result, comprehensiv… ▽ More

    Submitted 24 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 6 pages, 2 figures

  7. arXiv:2110.06421  [pdf, other

    cs.LG

    Revisiting Latent-Space Interpolation via a Quantitative Evaluation Framework

    Authors: Lu Mi, Tianxing He, Core Francisco Park, Hao Wang, Yue Wang, Nir Shavit

    Abstract: Latent-space interpolation is commonly used to demonstrate the generalization ability of deep latent variable models. Various algorithms have been proposed to calculate the best trajectory between two encodings in the latent space. In this work, we show how data labeled with semantically continuous attributes can be utilized to conduct a quantitative evaluation of latent-space interpolation algori… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 11 pages

  8. arXiv:2106.14880  [pdf, other

    cs.CV

    HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps

    Authors: Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov

    Abstract: High Definition (HD) maps are maps with precise definitions of road lanes with rich semantics of the traffic rules. They are critical for several key stages in an autonomous driving system, including motion forecasting and planning. However, there are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack to gener… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  9. Learning Guided Electron Microscopy with Active Acquisition

    Authors: Lu Mi, Hao Wang, Yaron Meirovitch, Richard Schalek, Srinivas C. Turaga, Jeff W. Lichtman, Aravinthan D. T. Samuel, Nir Shavit

    Abstract: Single-beam scanning electron microscopes (SEM) are widely used to acquire massive data sets for biomedical study, material analysis, and fabrication inspection. Datasets are typically acquired with uniform acquisition: applying the electron beam with the same power and duration to all image pixels, even if there is great variety in the pixels' importance for eventual use. Many SEMs are now able t… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: MICCAI 2020

  10. arXiv:2006.10621  [pdf, other

    cs.LG cs.CV stat.ML

    On the Predictability of Pruning Across Scales

    Authors: Jonathan S. Rosenfeld, Jonathan Frankle, Michael Carbin, Nir Shavit

    Abstract: We show that the error of iteratively magnitude-pruned networks empirically follows a scaling law with interpretable coefficients that depend on the architecture and task. We functionally approximate the error of the pruned networks, showing it is predictable in terms of an invariant tying width, depth, and pruning level, such that networks of vastly different pruned densities are interchangeable.… ▽ More

    Submitted 3 July, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

  11. arXiv:1912.02165  [pdf, other

    cs.DC cs.LG

    L3 Fusion: Fast Transformed Convolutions on CPUs

    Authors: Rati Gelashvili, Nir Shavit, Aleksandar Zlateski

    Abstract: Fast convolutions via transforms, either Winograd or FFT, had emerged as a preferred way of performing the computation of convolutional layers, as it greatly reduces the number of required operations. Recent work shows that, for many layer structures, a well--designed implementation of fast convolutions can greatly utilize modern CPUs, significantly reducing the compute time. However, the generous… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  12. arXiv:1910.04858  [pdf, other

    cs.CV cs.LG

    Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate

    Authors: Lu Mi, Hao Wang, Yonglong Tian, Hao He, Nir Shavit

    Abstract: Uncertainty estimation is an essential step in the evaluation of the robustness for deep learning models in computer vision, especially when applied in risk-sensitive areas. However, most state-of-the-art deep learning models either fail to obtain uncertainty estimation or need significant modification (e.g., formulating a proper Bayesian treatment) to obtain it. Most previous methods are not able… ▽ More

    Submitted 10 January, 2022; v1 submitted 27 September, 2019; originally announced October 2019.

    Comments: In proceedings of the 36th AAAI Conference on Artificial Intelligence

  13. arXiv:1909.12673  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    A Constructive Prediction of the Generalization Error Across Scales

    Authors: Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit

    Abstract: The dependency of the generalization error of neural networks on model and dataset size is of critical importance both in practice and for understanding the theory of neural networks. Nevertheless, the functional form of this dependency remains elusive. In this work, we present a functional form which approximates well the generalization error in practice. Capitalizing on the successful concept of… ▽ More

    Submitted 20 December, 2019; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: ICLR 2020

  14. arXiv:1812.01157  [pdf, other

    cs.CV

    Cross-Classification Clustering: An Efficient Multi-Object Tracking Technique for 3-D Instance Segmentation in Connectomics

    Authors: Yaron Meirovitch, Lu Mi, Hayk Saribekyan, Alexander Matveev, David Rolnick, Nir Shavit

    Abstract: Pixel-accurate tracking of objects is a key element in many computer vision applications, often solved by iterated individual object tracking or instance segmentation followed by object matching. Here we introduce cross-classification clustering (3C), a technique that simultaneously tracks complex, interrelated objects in an image stack. The key idea in cross-classification is to efficiently turn… ▽ More

    Submitted 15 June, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: 11 figures

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 8425-8435

  15. arXiv:1705.10882  [pdf, other

    cs.CV cs.AI q-bio.NC stat.ML

    Morphological Error Detection in 3D Segmentations

    Authors: David Rolnick, Yaron Meirovitch, Toufiq Parag, Hanspeter Pfister, Viren Jain, Jeff W. Lichtman, Edward S. Boyden, Nir Shavit

    Abstract: Deep learning algorithms for connectomics rely upon localized classification, rather than overall morphology. This leads to a high incidence of erroneously merged objects. Humans, by contrast, can easily detect such errors by acquiring intuition for the correct morphology of objects. Biological neurons have complicated and variable shapes, which are challenging to learn, and merge errors take a mu… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

    Comments: 13 pages, 6 figures

  16. arXiv:1705.10694  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Deep Learning is Robust to Massive Label Noise

    Authors: David Rolnick, Andreas Veit, Serge Belongie, Nir Shavit

    Abstract: Deep neural networks trained on large supervised datasets have led to impressive results in image classification and other tasks. However, well-annotated datasets can be time-consuming and expensive to collect, lending increased interest to larger but noisy datasets that are more easily obtained. In this paper, we show that deep neural networks are capable of generalizing from training data for wh… ▽ More

    Submitted 26 February, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

  17. arXiv:1703.01467  [pdf, other

    cs.CV

    Generative Compression

    Authors: Shibani Santurkar, David Budden, Nir Shavit

    Abstract: Traditional image and video compression algorithms rely on hand-crafted encoder/decoder pairs (codecs) that lack adaptability and are agnostic to the data being compressed. Here we describe the concept of generative compression, the compression of data using generative models, and suggest that it is a direction worth pursuing to produce more accurate and visually pleasing reconstructions at much d… ▽ More

    Submitted 4 June, 2017; v1 submitted 4 March, 2017; originally announced March 2017.

  18. arXiv:1702.07386  [pdf, other

    cs.CV

    Toward Streaming Synapse Detection with Compositional ConvNets

    Authors: Shibani Santurkar, David Budden, Alexander Matveev, Heather Berlin, Hayk Saribekyan, Yaron Meirovitch, Nir Shavit

    Abstract: Connectomics is an emerging field in neuroscience that aims to reconstruct the 3-dimensional morphology of neurons from electron microscopy (EM) images. Recent studies have successfully demonstrated the use of convolutional neural networks (ConvNets) for segmenting cell membranes to individuate neurons. However, there has been comparatively little success in high-throughput identification of the i… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

    Comments: 10 pages, 9 figures

  19. arXiv:1612.02120  [pdf, other

    q-bio.QM cs.AI q-bio.NC

    A Multi-Pass Approach to Large-Scale Connectomics

    Authors: Yaron Meirovitch, Alexander Matveev, Hayk Saribekyan, David Budden, David Rolnick, Gergely Odor, Seymour Knowles-Barley, Thouis Raymond Jones, Hanspeter Pfister, Jeff William Lichtman, Nir Shavit

    Abstract: The field of connectomics faces unprecedented "big data" challenges. To reconstruct neuronal connectivity, automated pixel-level segmentation is required for petabytes of streaming electron microscopy data. Existing algorithms provide relatively good accuracy but are unacceptably slow, and would require years to extract connectivity graphs from even a single cubic millimeter of neural tissue. Here… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

    Comments: 18 pages, 10 figures

  20. arXiv:1611.06565  [pdf, other

    cs.CV cs.DC cs.NE

    Deep Tensor Convolution on Multicores

    Authors: David Budden, Alexander Matveev, Shibani Santurkar, Shraman Ray Chaudhuri, Nir Shavit

    Abstract: Deep convolutional neural networks (ConvNets) of 3-dimensional kernels allow joint modeling of spatiotemporal features. These networks have improved performance of video and volumetric image analysis, but have been limited in size due to the low memory ceiling of GPU hardware. Existing CPU implementations overcome this constraint but are impractically slow. Here we extend and optimize the faster W… ▽ More

    Submitted 11 June, 2017; v1 submitted 20 November, 2016; originally announced November 2016.

    Comments: 11 pages, 4 figures, 1 supplementary doc

  21. arXiv:1607.06139  [pdf, ps, other

    cs.DC

    A Complexity-Based Hierarchy for Multiprocessor Synchronization

    Authors: Faith Ellen, Rati Gelashvili, Nir Shavit, Leqi Zhu

    Abstract: For many years, Herlihy's elegant computability based Consensus Hierarchy has been our best explanation of the relative power of various types of multiprocessor synchronization objects when used in deterministic algorithms. However, key to this hierarchy is treating synchronization instructions as distinct objects, an approach that is far from the real-world, where multiprocessor programs apply sy… ▽ More

    Submitted 3 May, 2018; v1 submitted 20 July, 2016; originally announced July 2016.

  22. arXiv:1411.5383  [pdf, other

    q-bio.NC cs.DS math.PR math.ST

    Johnson-Lindenstrauss Compression with Neuroscience-Based Constraints

    Authors: Zeyuan Allen-Zhu, Rati Gelashvili, Silvio Micali, Nir Shavit

    Abstract: Johnson-Lindenstrauss (JL) matrices implemented by sparse random synaptic connections are thought to be a prime candidate for how convergent pathways in the brain compress information. However, to date, there is no complete mathematical support for such implementations given the constraints of real neural tissue. The fact that neurons are either excitatory or inhibitory implies that every so imple… ▽ More

    Submitted 19 November, 2014; originally announced November 2014.

    Comments: A shorter version of this paper has appeared in the Proceedings of the National Academy of Sciences

  23. arXiv:1411.0168  [pdf, other

    cs.DC cs.DS

    On the Importance of Registers for Computability

    Authors: Rati Gelashvili, Mohsen Ghaffari, Jerry Li, Nir Shavit

    Abstract: All consensus hierarchies in the literature assume that we have, in addition to copies of a given object, an unbounded number of registers. But why do we really need these registers? This paper considers what would happen if one attempts to solve consensus using various objects but without any registers. We show that under a reasonable assumption, objects like queues and stacks cannot emulate th… ▽ More

    Submitted 1 November, 2014; originally announced November 2014.

    Comments: 12 pages, 0 figures

  24. arXiv:1405.5689  [pdf, other

    cs.DC

    Inherent Limitations of Hybrid Transactional Memory

    Authors: Dan Alistarh, Justin Kopinsky, Petr Kuznetsov, Srivatsan Ravi, Nir Shavit

    Abstract: Several Hybrid Transactional Memory (HyTM) schemes have recently been proposed to complement the fast, but best-effort, nature of Hardware Transactional Memory (HTM) with a slow, reliable software backup. However, the fundamental limitations of building a HyTM with nontrivial concurrency between hardware and software transactions are still not well understood. In this paper, we propose a general… ▽ More

    Submitted 17 February, 2015; v1 submitted 22 May, 2014; originally announced May 2014.

  25. arXiv:1405.5461  [pdf, other

    cs.DC cs.DS

    The LevelArray: A Fast, Practical Long-Lived Renaming Algorithm

    Authors: Dan Alistarh, Justin Kopinsky, Alexander Matveev, Nir Shavit

    Abstract: The long-lived renaming problem appears in shared-memory systems where a set of threads need to register and deregister frequently from the computation, while concurrent operations scan the set of currently registered threads. Instances of this problem show up in concurrent implementations of transactional memory, flat combining, thread barriers, and memory reclamation schemes for lock-free data s… ▽ More

    Submitted 21 May, 2014; originally announced May 2014.

    Comments: ICDCS 2014

  26. arXiv:1311.3200  [pdf, ps, other

    cs.DC

    Are Lock-Free Concurrent Algorithms Practically Wait-Free?

    Authors: Dan Alistarh, Keren Censor-Hillel, Nir Shavit

    Abstract: Lock-free concurrent algorithms guarantee that some concurrent operation will always make progress in a finite number of steps. Yet programmers prefer to treat concurrent code as if it were wait-free, guaranteeing that all operations always make progress. Unfortunately, designing wait-free algorithms is generally a very complex task, and the resulting algorithms are not always efficient. While obt… ▽ More

    Submitted 15 November, 2013; v1 submitted 13 November, 2013; originally announced November 2013.

    Comments: 25 pages