RLHF and Alignment Experts
Explore insights and expertise from leading minds in Reinforcement Learning from Human Feedback and AI Alignment.
People
Daniel Vila
CuratorEverything datasets and human feedback for AI at Hugging Face. Prev: co-founder and CEO of Argilla (acquired by Hugging Face)
A LLN - large language Nathan - (RL, RLHF, society, robotics), athlete, yogi, chef Writes http://interconnects.ai At Ai2 via HuggingFace, Berkeley, and normal places
ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://amzn.to/4fqvn0D). Blogging about AI research at magazine.sebastianraschka.com.
Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU https://emerge-lab.github.io https://www.admonymous.co/eugenevinitsky
Research Scientist at Google DeepMind, interested in multiagent reinforcement learning, game theory, games, and search/planning. Lover of Linux š§, coffee ā, and retro gaming. Big fan of open-source. #gohabsgo šØš¦ For more info: https://linktr.ee/sharky6000
Assistant Professor in CS + AI at USC. Previously at Stanford, CMU. Machine Learning, Decision Making, AI-for-Science, Generative AI, ML Systems, LLMs. https://willieneis.github.io
Postdoc in NLP/AI at the Allen Institute for AI & the University of Washington. āØI am on the academic job market for faculty positions! Interested in LLM post-training, RL for NLP, semantics & pragmatics. š https://valentinapy.github.io
Assistant Professor & Faculty Fellow, NYU. AI Fellow, Georgetown University. Incoming Faculty Researcher, Google DeepMind. Probabilistic methods for robust and transparent ML & AI Policy. Prev: Oxford, Yale, UC Berkeley. https://timrudner.com
I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.
PhD student at @cmurobotics.bsky.social working on interactive learning from implicit human feedback (e.g. imitation/RLHF). no model is an island. https://gokul.dev/.
Building generally intelligent agents - AI, ML, RL, Robotics @Unity3d
Computer Science PhD Student @ Stanford | Geopolitics & Technology Fellow @ Harvard Kennedy School/Belfer | Vice Chair EU AI Code of Practice | Views are my own
Similar packs
AI and Robotics Leaders
AI and Robotics Leaders
Connect with prominent researchers and thought leaders in AI and robotics.
AI Chatterboxes
AI Chatterboxes
A lively group of AI experts and researchers sharing insights and innovations.
AI Nerds
AI Nerds
A friendly group of people who love everything about artificial intelligence.
AI Safety People
AI Safety People
A group of experts focused on ensuring the safe and secure use of AI technology.
AI Safety Fans
AI Safety Fans
A community of people focused on making AI safe and secure.
Anthropic Product Developers
Anthropic Product Developers
A collaborative community focused on the development and application of Anthropic's AI technologies.