John Schulman Caltech

About 26 results

Open links in new tab

Any time

joschu.net
http://joschu.net
John Schulman's Homepage
John Schulman's Homepage I am currently a researcher at Anthropic, where I’m working on aligning large language models; some of my interests include scalable oversight and …
joschu.net
http://joschu.net › presentations.html
Presentations
John Schulman's Homepage. Presentations. Some recent talks: 2024 Talk about OpenAI Model Spec at Scale conference; 2023 ICML talk on proxy objectives; 2023 Berkeley talk on …
joschu.net
http://joschu.net › publications.html
Selected Publications
John Schulman, Jonathan Ho, Cameron Lee, and Pieter Abbeel International Symposium on Robotics Research (ISRR), 2013 Paper / Videos
joschu.net
http://joschu.net › docs › thesis.pdf
[PDF]
O P T I M I Z I N G E X P E C TAT I O N S : F R O M D E E P R E I …
john schulman Summer, 2016 A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Computer Science in the Graduate Division of the …
joschu.net
http://joschu.net › blog › opinionated-guide-ml-research.html
An Opinionated Guide to ML Research - joschu.net
Jan 24, 2020 · John Schulman's Homepage. An Opinionated Guide to ML Research. Posted on 2020/01/24. ← back to blog index. I originally wrote this guide in back in December 2017 for …
joschu.net
http://joschu.net › blog › kl-approx.html
Approximating KL Divergence
Mar 7, 2020 · Note that the bias of k2 is incredibly low here: it’s 0.2%. Now let’s try for a larger true KL divergence. p=N(1,1) gives us a true KL divergence of 0.5.
joschu.net
http://joschu.net › blog.html
Blog Index - joschu.net
Jan 24, 2020 · John Schulman's Homepage. Blog Index. Sending Samples Without Bits-Back (2020/03/08) Approximating KL Divergence (2020/03/07) An Opinionated Guide to ML …
joschu.net
http://joschu.net › docs
[PDF]
Deep Reinforcement Learning: Policy Gradients and Q-Learning
Recent Success Stories in Deep RL I ATARI using deep Q-learning4, policy gradients5, DAGGER6 I Superhuman Go using supervised learning + policy gradients + Monte Carlo tree …
joschu.net
http://joschu.net › awards.html
Awards - joschu.net
John Schulman's Homepage. Awards [2018] MIT Technology Review's 35 Innovators Under 35. [2016] C.V. Ramamoorthy Distinguished Research Award [2013] Best Vision Paper, awarded …
joschu.net
http://joschu.net › code.html
Code - joschu.net
Code. GitHub profile. Highlighted projects developed by my collaborators and me: Procgen Benchmark (2019): GitHub / blog post.; Gym Retro (2018): GitHub / blog post on dataset / …
Pagination
- 1
- 2