About 7,140,000 results
Open links in new tab
  1. T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning

    Jun 2, 2025 · View a PDF of the paper titled T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning, by Yanjun Fu and 2 other authors

  2. GitHub - Dynamite321/T-SHIRT

    We demonstrate that models instruction-tuned on a curated dataset (only 5% of the original size) using T-SHIRT can outperform those trained on the entire large-scale dataset by up to 5.48 …

  3. In this paper, we present Token-Selective Hierarchical Data Selection for Instruction Tuning (T- SHIRT), a framework for selecting high-quality subsets from instruction tuning datasets.

  4. T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning

    We demonstrate that models instruction-tuned on a curated dataset (only 5% of the original size) using T-SHIRT can outperform those trained on the entire large-scale dataset by up to 5.48 …

  5. 2025年6月3日多模态大模型论文推送 - 知乎

    Jun 4, 2025 · 简介:这篇论文提出了StochasTok,一个随机分割token的分词方案 diffusion LLM arxiv.org/pdf/2506.0041 标题: Accelerating Diffusion LLMs via Adaptive Parallel Decoding 关 …

  6. Yanjun Fu

    Dec 16, 2025 · His research interests include LLMs, Efficient Post-training, and Trustworthy ML. (more) T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning. Yanjun …

  7. T-Shirt: Token-Selective Hierarchical Data Selection for Instruction Tuning

    Jun 3, 2025 · In this paper, we present Token-Selective Hierarchical Data Selection for Instruction Tuning (T-Shirt), a framework for selecting high-quality subsets from instruction tuning datasets.

  8. T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning

    A new data selection framework, T-SHIRT, improves instruction tuning for Large Language Models by focusing on token-level informativeness and robust sample quality, leading to better …

  9. T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning

    Jun 2, 2025 · We demonstrate that models instruction-tuned on a curated dataset (only 5% of the original size) using T-SHIRT can outperform those trained on the entire large-scale dataset by …

  10. Sanghamitra Dutta - Publications - Google Sites

    Fu , F. Hamman, S. Dutta, "T-Shirt: Token-Selective Hierarchical Data Selection for Instruction Tuning" Neural Information Processing Systems (NeurIPS 2025). [Full-Paper]