paper reading
把看过的论文做一个总结,防止我忘掉 # 阅读习惯 1. 看到公式变量用绿色标注一下!不用这样…… 2. 几遍 1. 全文通读,分清楚每句话的意义,只记录重要的话,写下问题 2. 每章提炼出重点 3. 回答以上问题
paper集合
- FULL PARAMETER FINE-TUNING FOR LARGE LANGUAGE MODELS WITH LIMITED
RESOURCES
- 全参数|微调
- Towards A Unified View Of Parameter-Efficient Transfer Learning
- 微调|高效参数
- TinyLLM: Learning a Small Student from Multiple Large Language Models
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
- FlexGen: high-throughput generative inference of large language models
- LISA: Layerwise Importance Sampling for Memory-Efficient Large
Language Model
- Fine-Tuning
- TRANSFORMER EXPLAINER: Interactive Learning of Text-Generative
Models
- 没啥营养,把transformer的过程搞了个可视化(也没啥用),唯一的贡献就是让我重新梳理了一遍transformer
- CLLMs: Consistency Large Language Models
- AN EMULATOR FOR FINE-TUNING LARGE LANGUAGE MODELS USING SMALL LANGUAGE MODELS
- Kubric: A scalable dataset generator
- GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
- 毫无印象的论文
- MANISKILL3: GPU PARALLELIZED ROBOTICS SIMULATION AND RENDERING FOR GENERALIZABLE EMBODIED AI
- Aligner: Efficient Alignment by Learning to Correct
- A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
- 综述文章,对齐
- Multi-Goal Reinforcement Learning: Challenging Robotics Environments
and Request for Research
- 对robotic在强化学习上的总结
- Generative Artificial Intelligence in Robotic Manipulation: A Survey
- ai in 机器人操作的综述,全讲的方法
- VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
- 从视频中获得知识,没看仔细
- CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
- 这个数据收集方式有点野了
- DIFFTACTILE: A PHYSICS-BASED DIFFERENTIABLE TACTILE SIMULATOR FOR
CONTACT-RICH ROBOTIC MANIPULATION
- 借助tactile实现精细操作,但是只有2指
- 但是还有一点问题,以后脑子清晰点再看看