Lvfang Tao 陶履方

Researcher @ Tencent
Reinforcement Learning · LLMs & Agents · AutoML · Neural Compression

harbour_city.jpg

Lvfang Tao has been a researcher at Tencent since 2022, after receiving his Master of Science degree from the School of Electronic and Computer Engineering (SECE) at Peking University. His most recent work focuses on mid-training and post-training for large language models, including synthetic data, on-policy distillation, RLVR, and agentic reinforcement learning. Since 2019, he has led projects on from-scratch and continual pretraining of both small and large language models across text, speech, and image modalities. He is also a core contributor to the reinforcement-learning platform AI Arena, which serves the education and research community as well as game developers.

陶履方,2022 年毕业于北京大学信息工程学院并获得理学硕士学位,同年加入腾讯担任算法研究员。作为 腾讯开悟平台 算法方向的核心贡献者,研究方向包括强化学习环境、智能体与平台系统。近期工作主要围绕大模型的中训练与后训练阶段,主导项目涵盖合成数据、on-policy 蒸馏、RLVR、Tool-Use / Agentic RL 等技术方向,相关成果已在公开竞赛与内部产品中得到验证。自 2019 年起,在文本、语音和图像等模态上积累了大规模预训练与增量预训练的经验。

news

Dec 15, 2024 Presenting Tinytron’s double-1st-place solutions (model compression & pretrain tracks) as Project Lead in NeurIPS 2024 competition workshop “Edge-Device Large Language Model Competition”. Workshop link.
Jul 21, 2023 “AdaNIC: Towards Practical Neural Image Compression via Dynamic Transform Routing” accepted to ICCV 2023 (first author). Paper link.
Dec 08, 2022 Presenting Team TEG-AutoML’s 2nd-place solution as Project Lead in NeurIPS 2022 competition workshop “AutoML Decathlon: Diverse Tasks, Modern Methods, and Efficiency at Scale”. Workshop link.

selected publications

  1. Tinytron Technical Report
    Lvfang Tao, Renjie Mao, Yongguang Lin, and 3 more authors
    Dec 2024
  2. Automl decathlon: Diverse tasks, modern methods, and efficiency at scale
    Nicholas Roberts, Samuel Guo, Cong Xu, and 8 more authors
    In NeurIPS 2022 Competition Track, Dec 2023
  3. Adanic: Towards practical neural image compression via dynamic transform routing
    Lvfang Tao, Wei Gao, Ge Li, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Dec 2023
  4. A hardware implementation of entropy encoder for 8K video coding
    Lvfang Tao and Wei Gao
    In 2022 IEEE International Conference on Multimedia and Expo (ICME), Dec 2022
  5. Efficient channel pruning based on architecture alignment and probability model bypassing
    Lvfang Tao and Wei Gao
    In 2021 IEEE international conference on systems, man, and cybernetics (SMC), Dec 2021
  6. A fast view synthesis implementation method for light field applications
    Wei Gao, Linjie Zhou, and Lvfang Tao
    ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Dec 2021