RLHF and Alignment Experts
Explore insights and expertise from leading minds in Reinforcement Learning from Human Feedback and AI Alignment.
People
Daniel Vila
CuratorEverything datasets and human feedback for AI at Hugging Face. Prev: co-founder and CEO of Argilla (acquired by Hugging Face)
A LLN - large language Nathan - (RL, RLHF, society, robotics), athlete, yogi, chef Writes http://interconnects.ai At Ai2 via HuggingFace, Berkeley, and normal places
Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU https://emerge-lab.github.io https://www.admonymous.co/eugenevinitsky
Research Scientist at Google DeepMind, interested in multiagent reinforcement learning, game theory, games, and search/planning. Lover of Linux 🐧, coffee ☕, and retro gaming. Big fan of open-source. #gohabsgo 🇨🇦 For more info: https://linktr.ee/sharky6000
ML researcher and former statistics professor turned research engineer. Author of “Build a Large Language Model From Scratch” (https://amzn.to/4fqvn0D). AI research blogger at https://magazine.sebastianraschka.com.
Assistant Professor in CS + AI at USC. Previously at Stanford, CMU. Machine Learning, Decision Making, AI-for-Science, Generative AI, ML Systems, LLMs. https://willieneis.github.io
Postdoc in NLP/AI at the Allen Institute for AI & the University of Washington. ✨I am on the academic job market for faculty positions! Interested in LLM post-training, RL for NLP, semantics & pragmatics. 🌐 https://valentinapy.github.io
Assistant Professor & Faculty Fellow, NYU. AI Fellow, Georgetown University. Incoming Faculty Researcher, Google DeepMind. Probabilistic methods for robust and transparent ML & AI Policy. Prev: Oxford, Yale, UC Berkeley. https://timrudner.com
I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.
PhD student at @cmurobotics.bsky.social working on interactive learning from implicit human feedback (e.g. imitation/RLHF). no model is an island. https://gokul.dev/.
Building generally intelligent agents - AI, ML, RL, Robotics @Unity3d
Postdoc at Princeton PLI. Formerly PhD at Stanford CS. Working on behavioral machine learning. https://kawine.github.io/
Similar packs
AI and Robotics Leaders
AI and Robotics Leaders
Connect with prominent researchers and thought leaders in AI and robotics.
AI Chatterboxes
AI Chatterboxes
A lively group of AI experts and researchers sharing insights and innovations.
AI Nerds
AI Nerds
A friendly group of people who love everything about artificial intelligence.
AI Safety People
AI Safety People
A group of experts focused on ensuring the safe and secure use of AI technology.
AI Safety Fans
AI Safety Fans
A community of people focused on making AI safe and secure.
Anthropic Product Developers
Anthropic Product Developers
A collaborative community focused on the development and application of Anthropic's AI technologies.