Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Asur, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15339  [pdf, other

    cs.LG cs.CL

    UNA: Unifying Alignments of RLHF/PPO, DPO and KTO by a Generalized Implicit Reward Function

    Authors: Zhichao Wang, Bin Bi, Can Huang, Shiva Kumar Pentyala, Zixu James Zhu, Sitaram Asur, Na Claire Cheng

    Abstract: An LLM is pretrained on trillions of tokens, but the pretrained LLM may still generate undesired responses. To solve this problem, alignment techniques such as RLHF, DPO and KTO are proposed. However, these alignment techniques have limitations. For example, RLHF requires training the reward model and policy separately, which is complex, time-consuming, memory intensive and unstable during trainin… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  2. arXiv:2407.16216  [pdf, other

    cs.CL

    A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More

    Authors: Zhichao Wang, Bin Bi, Shiva Kumar Pentyala, Kiran Ramnath, Sougata Chaudhuri, Shubham Mehrotra, Zixu, Zhu, Xiang-Bo Mao, Sitaram Asur, Na, Cheng

    Abstract: With advancements in self-supervised learning, the availability of trillions tokens in a pre-training corpus, instruction fine-tuning, and the development of large Transformers with billions of parameters, large language models (LLMs) are now capable of generating factual and coherent responses to human queries. However, the mixed quality of training data can lead to the generation of undesired re… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  3. arXiv:2406.17923  [pdf, other

    cs.CL

    PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

    Authors: Shiva Kumar Pentyala, Zhichao Wang, Bin Bi, Kiran Ramnath, Xiang-Bo Mao, Regunathan Radhakrishnan, Sitaram Asur, Na, Cheng

    Abstract: Large language models (LLMs) have shown remarkable abilities in diverse natural language processing (NLP) tasks. The LLMs generally undergo supervised fine-tuning (SFT) followed by preference alignment to be usable in downstream applications. However, this sequential training pipeline leads to alignment tax that degrades the LLM performance. This paper introduces PAFT, a new PArallel training pa… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:1411.3214  [pdf, ps, other

    cs.CY cs.HC cs.SI physics.soc-ph

    Deciding what to display: maximizing the information value of social media

    Authors: Sandra Servia-Rodríguez, Bernardo A. Huberman, Sitaram Asur

    Abstract: In information-rich environments, the competition for users' attention leads to a flood of content from which people often find hard to sort out the most relevant and useful pieces. Using Twitter as a case study, we applied an attention economy solution to generate the most informative tweets for its users. By considering the novelty and popularity of tweets as objective measures of their relevanc… ▽ More

    Submitted 12 November, 2014; originally announced November 2014.

  5. arXiv:1312.0649  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Dynamics of Trends and Attention in Chinese Social Media

    Authors: Louis Lei Yu, Sitaram Asur, Bernardo A. Huberman

    Abstract: There has been a tremendous rise in the growth of online social networks all over the world in recent years. It has facilitated users to generate a large amount of real-time content at an incessant rate, all competing with each other to attract enough attention and become popular trends. While Western online social networks such as Twitter have been well studied, the popular Chinese microblogging… ▽ More

    Submitted 2 December, 2013; originally announced December 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1202.0327

  6. arXiv:1202.0332  [pdf, other

    cs.CY cs.NI cs.SI physics.soc-ph

    The Pulse of News in Social Media: Forecasting Popularity

    Authors: Roja Bandari, Sitaram Asur, Bernardo A. Huberman

    Abstract: News articles are extremely time sensitive by nature. There is also intense competition among news items to propagate as widely as possible. Hence, the task of predicting the popularity of news items on the social web is both interesting and challenging. Prior research has dealt with predicting eventual online popularity based on early popularity. It is most desirable, however, to predict the popu… ▽ More

    Submitted 1 February, 2012; originally announced February 2012.

  7. arXiv:1202.0327  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Artificial Inflation: The True Story of Trends in Sina Weibo

    Authors: Louis Yu, Sitaram Asur, Bernardo A. Huberman

    Abstract: There has been a tremendous rise in the growth of online social networks all over the world in recent years. This has facilitated users to generate a large amount of real-time content at an incessant rate, all competing with each other to attract enough attention and become trends. While Western online social networks such as Twitter have been well studied, characteristics of the popular Chinese m… ▽ More

    Submitted 1 February, 2012; originally announced February 2012.

    Comments: arXiv admin note: text overlap with arXiv:1107.3522

  8. arXiv:1107.3522  [pdf, other

    cs.CY cs.SI physics.soc-ph

    What Trends in Chinese Social Media

    Authors: Louis Yu, Sitaram Asur, Bernardo A. Huberman

    Abstract: There has been a tremendous rise in the growth of online social networks all over the world in recent times. While some networks like Twitter and Facebook have been well documented, the popular Chinese microblogging social network Sina Weibo has not been studied. In this work, we examine the key topics that trend on Sina Weibo and contrast them with our observations on Twitter. We find that there… ▽ More

    Submitted 18 July, 2011; originally announced July 2011.

  9. arXiv:1102.1402  [pdf, other

    cs.CY physics.soc-ph

    Trends in Social Media : Persistence and Decay

    Authors: Sitaram Asur, Bernardo A. Huberman, Gabor Szabo, Chunyan Wang

    Abstract: Social media generates a prodigious wealth of real-time content at an incessant rate. From all the content that people create and share, only a few topics manage to attract enough attention to rise to the top and become temporal trends which are displayed to users. The question of what factors cause the formation and persistence of trends is an important one that has not been answered yet. In this… ▽ More

    Submitted 7 February, 2011; originally announced February 2011.

  10. arXiv:1008.1253  [pdf, other

    cs.CY physics.soc-ph

    Influence and Passivity in Social Media

    Authors: Daniel M. Romero, Wojciech Galuba, Sitaram Asur, Bernardo A. Huberman

    Abstract: The ever-increasing amount of information flowing through Social Media forces the members of these networks to compete for attention and influence by relying on other people to spread their message. A large study of information propagation within Twitter reveals that the majority of users act as passive information consumers and do not forward the content to the network. Therefore, in order for in… ▽ More

    Submitted 6 August, 2010; originally announced August 2010.

  11. arXiv:1003.5699  [pdf, ps, other

    cs.CY physics.soc-ph

    Predicting the Future with Social Media

    Authors: Sitaram Asur, Bernardo A. Huberman

    Abstract: In recent years, social media has become ubiquitous and important for social networking and content sharing. And yet, the content that is generated from these websites remains largely untapped. In this paper, we demonstrate how social media content can be used to predict real-world outcomes. In particular, we use the chatter from Twitter.com to forecast box-office revenues for movies. We show tha… ▽ More

    Submitted 29 March, 2010; originally announced March 2010.