原创 從C++和OCaml程序員的視角看Rust (Part 2)

本文來源地址 SEPTEMBER 29, 2017 by GUILLAUME ENDIGNOUX @GEndignoux This post is the second of my series about Rust compared to

原创 從C++和OCaml程序員的視角看Rust (Part 1)

本文來源地址 SEPTEMBER 5, 2017 by GUILLAUME ENDIGNOUX @GEndignoux This summer, I decided to have a look at Rust, the new progr

原创 Jaakko Hintikka (1929-2015)

Finnish logician and philosopher Jaakko Hintikka died at the age of 86 after a brief illness on August 12, 2015. Jaakko

原创 Soft Actor-Critic Algorithms and Applications

Tuomas Haarnoja, Aurick Zhou, Kristian Hartikainen, George Tucker, Sehoon Ha, Jie Tan, Vikash Kumar, Henry Zhu, Abhishe

原创 Autoregressive Energy Machines

Charlie Nash, Conor Durkan Abstract Neural density estimators are flexible families of parametric models which have se

原创 PapeRman #6

本文描述了一個新的推斷智能體動機的方法。該方法基於影響圖,這是一種圖模型的類型,包含特別的決策和效用節點。圖標準可以被用來確智能體觀測動機和智能體干預動機** Understanding Agent Incentives using Cau

原创 Dynamic Sampling from Graphical Models

Dynamic Sampling from Graphical Models Weiming Feng, Nisheeth K. Vishnoi, Yitong Yin Abstract In this paper, we study

原创 Statistics and Samples in Distributional Reinforcement Learning

Statistics and Samples in Distributional Reinforcement Learning Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos,

原创 From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following

From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following Justin Fu, Anoop Korattika

原创 Environment-Independent Task Specifications via GLTL

Environment-Independent Task Specifications via GLTL Michael L. Littman, Ufuk Topcu, Jie Fu, Charles Isbell, Min Wen, Ja

原创 Programmable Agents

Programmable Agents Misha Denil, Sergio Gómez Colmenarejo, Serkan Cabi, David Saxton, Nando de Freitas (Submitted on 20

原创 More Adaptive Algorithms for Adversarial Bandits

More Adaptive Algorithms for Adversarial Bandits Authors: Chen-Yu Wei, Haipeng Luo Institute: University of Southern Cal

原创 EMERGENT COORDINATION THROUGH COMPETITION

EMERGENT COORDINATION THROUGH COMPETITION Authors: Siqi Liu, Guy Lever, Josh Merel, Saran Tunyasuvunakool, Nicolas Heess

原创 PapeRman #5

對抗健壯性的研究非常具有挑戰性。在衆多研究方向中,存在一些相應的進展。本篇論文是一個較清楚的整理,有助於大家更好地理解對抗網絡的工作機制。 On Evaluating Adversarial Robustness Authors: Nich

原创 Goodhart's original formulation

Any observed statistical regularity will tend to collapse once pressure is placed upon it for control purposes. Mario