ML Insights
About

Posts

  • Jan 14, 2024

    Direct Preference-based Policy Optimization without Reward Modeling

  • Jan 7, 2024

    Sample efficient Reinforcement Learning with Human Feedback via Active Exploration

  • Jan 7, 2024

    Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language

subscribe via RSS

ML Insights

  • ML Insights
  • Tech_Insight_

Blog related to latest research in machine learning, generative AI, large language models, reinforcement learning and advanced techniques in deep learning and machine learning.