Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Dharamsi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01614  [pdf, other

    cs.LG cs.AI

    Enhancing Stability for Large Models Training in Constrained Bandwidth Networks

    Authors: Yun Dai, Tejas Dharamsi, Byron Hsu, Tao Song, Hamed Firooz

    Abstract: Training extremely large language models with billions of parameters is a computationally intensive task that pushes the limits of current data parallel training systems. While techniques like ZeRO++ have enabled efficient distributed training of such giant models on inexpensive low-bandwidth clusters, they can suffer from convergence issues due to potential race conditions in the hierarchical par… ▽ More

    Submitted 31 July, 2024; v1 submitted 27 June, 2024; originally announced July 2024.

  2. arXiv:1711.06195  [pdf, other

    stat.ML cs.LG

    Neurology-as-a-Service for the Developing World

    Authors: Tejas Dharamsi, Payel Das, Tejaswini Pedapati, Gregory Bramble, Vinod Muthusamy, Horst Samulowitz, Kush R. Varshney, Yuvaraj Rajamanickam, John Thomas, Justin Dauwels

    Abstract: Electroencephalography (EEG) is an extensively-used and well-studied technique in the field of medical diagnostics and treatment for brain disorders, including epilepsy, migraines, and tumors. The analysis and interpretation of EEGs require physicians to have specialized training, which is not common even among most doctors in the developed world, let alone the developing world where physician sho… ▽ More

    Submitted 21 November, 2017; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Workshop on Machine Learning for the Developing World