Find link

language:

Find link is a tool written by Edward Betts.

searching for Deep reinforcement learning 86 found (118 total)

alternate case: deep reinforcement learning

Dharshan Kumaran (187 words) [view diff] exact match in snippet view article find links to article

more than 20,000 articles, is '"Human-level control through deep reinforcement learning" which he co-authored in 2015 with others including V Mnih, K

Q-learning (3,871 words) [view diff] case mismatch in snippet view article find links to article

2015). "Deep Reinforcement Learning with Double Q-learning". arXiv:1509.06461 [cs.LG]. van Hasselt, Hado; Guez, Arthur; Silver, David (2015). "Deep reinforcement

Adversarial machine learning (7,938 words) [view diff] exact match in snippet view article find links to article

by the 2-norm closely resembles Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing

Ansatz (645 words) [view diff] exact match in snippet view article find links to article

; Prati, E. (2019). "Coherent transport of quantum states by deep reinforcement learning". Communications Physics. 2 (1): 61. arXiv:1901.06603. Bibcode:2019CmPhy

Intelligent control (458 words) [view diff] exact match in snippet view article find links to article

supposed to capture the dynamics of a system. For the control part, deep reinforcement learning has shown its ability to control complex systems. Bayesian probability

Cognitive architecture (1,251 words) [view diff] case mismatch in snippet view article find links to article

Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;

Timothy Lillicrap (911 words) [view diff] case mismatch in snippet view article find links to article

David Silver, Daan Wierstra (2015). Continuous Control with Deep Reinforcement Learning. arXiv:1509.02971 Nicolas Heess, Jonathan J. Hunt, Timothy Lillicrap

ACM Prize in Computing (135 words) [view diff] exact match in snippet view article find links to article

to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. 2020 Scott Aaronson For groundbreaking contributions

Denis Yarats (413 words) [view diff] case mismatch in snippet view article find links to article

co‑authored Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels (Yarats, Kostrikov & Fergus, ICLR 2021), which introduced

Palletizer (539 words) [view diff] exact match in snippet view article find links to article

position on the pallet. In recent years, some research has used deep reinforcement learning, where robotic agents aim to learn an optimal placement position

David Silver (computer scientist) (713 words) [view diff] exact match in snippet view article

Silver; et al. (25 February 2015). "Human-level control through deep reinforcement learning" (PDF). Nature. 518 (7540): 529–533. doi:10.1038/NATURE14236

Dorothy Okello (1,180 words) [view diff] exact match in snippet view article find links to article

published in the 2020 IST-Africa Conference (IST-Africa) (3) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment

Maluuba (1,281 words) [view diff] exact match in snippet view article find links to article

Maluuba published a research paper learning dialogue policies with deep reinforcement learning. In 2016, Maluuba also freely released the Frames dataset, which

Lit pool (259 words) [view diff] exact match in snippet view article find links to article

making and incentives design in the presence of a dark pool: a deep reinforcement learning approach". arXiv:1912.01129 [q-fin.MF]. Palmer, Max (2010-03-20)

Value learning (1,678 words) [view diff] case mismatch in snippet view article find links to article

Duan, Xiaoming; He, Jianping (June 2025). "Reward Models in Deep Reinforcement Learning: A Survey". arXiv:2506.09876 [cs.RO]. "What is Value Learning

Networked-loan (3,353 words) [view diff] exact match in snippet view article find links to article

with deep reinforcement learning integrated with high-order graph message-passing networks. It uses the framework of deep reinforcement learning to learn

Daniel Kroening (589 words) [view diff] case mismatch in snippet view article find links to article

"Deepsynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning". AAAI 2020, Vol. 35, No. 9, pages 7647-7656. Vijay D’Silva,

RFM (market research) (879 words) [view diff] case mismatch in snippet view article

Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv

Demis Hassabis (6,318 words) [view diff] exact match in snippet view article find links to article

learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods. Hassabis has predicted that

Apprenticeship learning (1,336 words) [view diff] exact match in snippet view article find links to article

Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. In Advances in Neural Information Processing

Beam tilt (746 words) [view diff] case mismatch in snippet view article find links to article

"Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning". IEEE Transactions on Cognitive Communications and Networking

AC-3 algorithm (799 words) [view diff] case mismatch in snippet view article find links to article

domain. Minh, Volodymyr (16 Jun 2016). "Asynchronous Methods for Deep Reinforcement Learning". arXiv:gr-qc/0610068. A.K. Mackworth. Consistency in networks

Imitation learning (1,339 words) [view diff] case mismatch in snippet view article find links to article

Ahmad A. Al; Yogamani, Senthil; Perez, Patrick (June 2022). "Deep Reinforcement Learning for Autonomous Driving: A Survey". IEEE Transactions on Intelligent

DeepStack (675 words) [view diff] exact match in snippet view article find links to article

Bakhtin, Anton; Lerer, Adam; Gong, Qucheng (2020). "Combining deep reinforcement learning and search for imperfect-information games". Advances in Neural

Henry X. Liu (1,232 words) [view diff] exact match in snippet view article find links to article

The advancement is enabled by the development of the dense deep reinforcement learning (D2RL) approach, which allows neural networks to learn from densified

Microswimmer (14,953 words) [view diff] exact match in snippet view article find links to article

trapped in certain flow structures by learning smart gravitaxis. Deep reinforcement learning has been used to explore microswimmer navigation problems in

Active learning (machine learning) (2,211 words) [view diff] case mismatch in snippet view article

https://arxiv.org/abs/2303.01560v2 Learning how to Active Learn: A Deep Reinforcement Learning Approach, Meng Fang, Yuan Li, Trevor Cohn, https://arxiv.org/abs/1708

OpenAI Five (2,279 words) [view diff] case mismatch in snippet view article find links to article

Venture Beat. Retrieved 22 April 2019. "Dota 2 with Large Scale Deep Reinforcement Learning" (PDF). OpenAI. Archived (PDF) from the original on 26 September

Michael Witbrock (716 words) [view diff] case mismatch in snippet view article find links to article

applications. Witbrock, Michael J., Srinivas, K., Thost, V., et al. "A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving," in Proceedings

Seega (game) (902 words) [view diff] case mismatch in snippet view article

Player for Seejeh (A.K.A Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning". Procedia Computer Science. 160: 241–247. doi:10.1016/j.procs

Dorin Comaniciu (881 words) [view diff] case mismatch in snippet view article find links to article

Andreas; Hornegger, Joachim; Comaniciu, Dorin (2019). "Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans". IEEE Transactions

Chainer (857 words) [view diff] exact match in snippet view article find links to article

previous record held by Facebook. ChainerRL adds state of art deep reinforcement learning algorithms, and ChainerUI is a management and visualization tool

Swarm robotics (2,533 words) [view diff] case mismatch in snippet view article find links to article

Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Hu, J.; Turgut

InterQuest Group Ltd (1,019 words) [view diff] case mismatch in snippet view article find links to article

Conference". Retrieved 27 November 2019. "Step into the AI Era: Deep Reinforcement Learning Workshop". Retrieved 27 November 2019. "UX Sessions". Retrieved

Paul Christiano (1,221 words) [view diff] case mismatch in snippet view article find links to article

single charity. At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement

Princeton Plasma Physics Laboratory (2,218 words) [view diff] exact match in snippet view article find links to article

Egemen (2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. doi:10.1038/s41586-024-07024-9

Tearing mode (309 words) [view diff] exact match in snippet view article find links to article

(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10

Montezuma's Revenge (video game) (1,541 words) [view diff] exact match in snippet view article

Petersen, Stig (February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10

Peter Stone (professor) (1,023 words) [view diff] exact match in snippet view article

Nature entitled Outracing champion Gran Turismo drivers with deep reinforcement learning, which reported on the creation of GT Sophy, a superhuman driving

Artificial intelligence (29,145 words) [view diff] exact match in snippet view article find links to article

against four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capable

Customer lifetime value (2,891 words) [view diff] case mismatch in snippet view article find links to article

Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv

Nested sampling algorithm (2,350 words) [view diff] exact match in snippet view article find links to article

framework for uncertainty quantification, optimization, and deep reinforcement learning, which also implements nested sampling. Since nested sampling

Mahjong and artificial intelligence (555 words) [view diff] case mismatch in snippet view article find links to article

Yang; Li Zhao; Tao Qin; Tie-Yan Liu; Hsiao-Wuen Hon (2020-04-01). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI].

Agent-based computational economics (1,982 words) [view diff] exact match in snippet view article find links to article

wholesale electricity markets realistically with multi-agent deep reinforcement learning". Energy and AI. 14: 100295. doi:10.1016/j.egyai.2023.100295

Evaluation function (2,186 words) [view diff] case mismatch in snippet view article find links to article

ICCA Journal Lai, Matthew (4 September 2015), Giraffe: Using Deep Reinforcement Learning to Play Chess, arXiv:1509.01549v1 "Neural network topology".

Gregory Dudek (1,273 words) [view diff] exact match in snippet view article find links to article

decision-making under uncertainty, using techniques including deep reinforcement learning and probabilistic modelling. Dudek has participated in the organization

Distributional Soft Actor Critic (369 words) [view diff] case mismatch in snippet view article find links to article

et al. (2018). "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor". ICML: 1861–1870. arXiv:1801.01290.

TD-Gammon (1,659 words) [view diff] case mismatch in snippet view article find links to article

Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Silver, David; Schrittwieser, Julian;

Pushmeet Kohli (1,064 words) [view diff] exact match in snippet view article find links to article

(February 2022). "Magnetic control of tokamak plasmas through deep reinforcement learning". Nature. 602 (7897): 414–419. Bibcode:2022Natur.602..414D. doi:10

Google Brain (4,293 words) [view diff] exact match in snippet view article find links to article

ISSN 1941-0468. Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. (May 2017). "Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates"

Superintelligence (4,700 words) [view diff] case mismatch in snippet view article find links to article

Tom B.; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences" (PDF). NeurIPS. arXiv:1706.03741. "Constitutional

Multipath TCP (2,826 words) [view diff] case mismatch in snippet view article find links to article

in TSVWG (Transport Area Working Group) dubbed as MP-DCCP. A deep Reinforcement Learning (DRL) framework for joint congestion control and packet scheduling

Convolutional neural network (15,555 words) [view diff] exact match in snippet view article find links to article

research described an application to Atari 2600 gaming. Other deep reinforcement learning models preceded it. Convolutional deep belief networks (CDBN)

Edward Y. Chang (2,509 words) [view diff] exact match in snippet view article find links to article

, Chang, E. Y. (2018). Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Advances in Neural Information

MuJoCo (314 words) [view diff] case mismatch in snippet view article find links to article

Jorge Pena; Westerlund, Tomi (2020). "Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: A Survey". 2020 IEEE Symposium Series on Computational

AI alignment (13,069 words) [view diff] case mismatch in snippet view article find links to article

Jacob; Krueger, David (June 28, 2022). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine

IIT Madras (8,007 words) [view diff] exact match in snippet view article find links to article

one of the country's largest groups in network analytics and deep reinforcement learning. Google has granted IIT Madras $1 million for setting up India's

AlphaDev (1,160 words) [view diff] exact match in snippet view article find links to article

Silver, David (2023). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618 (7964): 257–263. Bibcode:2023Natur.618..257M. doi:10

Language creation in artificial intelligence (1,807 words) [view diff] case mismatch in snippet view article find links to article

Dhruv (2017). "Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning". 2017 IEEE International Conference on Computer Vision (ICCV)

Rubik's Cube (10,504 words) [view diff] case mismatch in snippet view article find links to article

Prati (2021). "Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning". Journal of Physics A: Mathematical and Theoretical. 54 (5):

Quantum machine learning (8,984 words) [view diff] case mismatch in snippet view article find links to article

Xiaoli; Goan, Hsi-Sheng (2020). "Variational Quantum Circuits for Deep Reinforcement Learning". IEEE Access. 8: 141007–141024. arXiv:1907.00397. Bibcode:2020IEEEA

Pack hunter (6,199 words) [view diff] exact match in snippet view article find links to article

Keisuke (2022). "Collaborative hunting in artificial agents with deep reinforcement learning". doi:10.1101/2022.10.10.511517. Tomasello, Michael; Carpenter

List of volunteer computing projects (4,267 words) [view diff] exact match in snippet view article find links to article

2018-03-04 Software testing, chess Trains chess neural networks with deep reinforcement learning. Experiments with training parameters and net architectures No

Curriculum learning (1,389 words) [view diff] exact match in snippet view article find links to article

Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652.3159711. hdl:2142/101634.

Mahjong (13,469 words) [view diff] case mismatch in snippet view article find links to article

Hsiao-Wuen (31 March 2020). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI]. Lam, Desmond. "Chinese Gambling Superstitions

Reward hacking (1,518 words) [view diff] exact match in snippet view article find links to article

Barth-Maron, Gabriel; Vecerik, Matej; et al. (2017). "Data-efficient deep reinforcement learning for dexterous manipulation". arXiv:1704.03073 [cs.LG]. "Learning

Hover (behaviour) (1,882 words) [view diff] exact match in snippet view article

2023). "Exploring storm petrel pattering and sea-anchoring using deep reinforcement learning". Bioinspiration & Biomimetics. 18 (6). University of Portland

Internet of things (20,188 words) [view diff] exact match in snippet view article find links to article

driving force for autonomous IoT. An approach in this context is deep reinforcement learning where most of IoT systems provide a dynamic and interactive environment

Tokamak (14,719 words) [view diff] exact match in snippet view article find links to article

(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10

Federated learning (5,875 words) [view diff] case mismatch in snippet view article find links to article

Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression

Exploration–exploitation dilemma (1,855 words) [view diff] case mismatch in snippet view article find links to article

[cs.LG]. Weng, Lilian (2020-06-07). "Exploration Strategies in Deep Reinforcement Learning". lilianweng.github.io. Retrieved 2024-09-15. Şimşek, Özgür;

Occupant-centric building controls (1,924 words) [view diff] exact match in snippet view article find links to article

associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. Bibcode:2019BuEnv

Occupant-centric building controls (1,924 words) [view diff] exact match in snippet view article find links to article

associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. Bibcode:2019BuEnv

Timeline of computing 2020–present (23,761 words) [view diff] exact match in snippet view article find links to article

Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10

Criticism of Google (19,859 words) [view diff] case mismatch in snippet view article find links to article

develop a rebuttal paper titled “Stronger Baselines for Evaluating Deep Reinforcement Learning in Chip Placement,” a team effort with five other co-authors

Drones in wildfire management (5,201 words) [view diff] case mismatch in snippet view article find links to article

Mousavi, Seyed Sajad; Schukat, Michael; Howley, Enda (2018). "Deep Reinforcement Learning: An Overview". Proceedings of SAI Intelligent Systems Conference

Applications of artificial intelligence (19,251 words) [view diff] exact match in snippet view article find links to article

Hassabis, Demis (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10

AI-driven design automation (6,349 words) [view diff] exact match in snippet view article find links to article

from Google researchers between 2020 and 2021. They created a deep reinforcement learning method for planning the layout of a chip, known as floorplanning

Glossary of engineering: M–Z (31,185 words) [view diff] case mismatch in snippet view article find links to article

Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Feynman, Richard

Optuna (2,789 words) [view diff] case mismatch in snippet view article find links to article

Ahmad A. Al; Yogamani, Senthil; Pérez, Patrick (2021-02-09). "Deep Reinforcement Learning for Autonomous Driving: A Survey". IEEE Transactions on Intelligent

Alexandre M. Bayen (2,938 words) [view diff] exact match in snippet view article find links to article

integration of microsimulation tools (SUMO and Aimsun) with early deep reinforcement learning libraries (RLlib and rllab) implemented on the cloud (AWS and

Rainbow Dash (2,577 words) [view diff] exact match in snippet view article find links to article

"Rainbow Dash" that successfully taught itself to walk using deep reinforcement learning. The robot demonstrated the ability to learn to walk backward

2023 in science (44,594 words) [view diff] exact match in snippet view article find links to article

Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10

AI safety (10,513 words) [view diff] case mismatch in snippet view article find links to article

Jacob; Krueger, David (2022-06-28). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine

Reinforcement learning from human feedback (8,617 words) [view diff] case mismatch in snippet view article find links to article

Brown, Tom; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences". Advances in Neural Information Processing

Products and applications of OpenAI (6,898 words) [view diff] exact match in snippet view article find links to article

(MOBA) games and how OpenAI Five has demonstrated the use of deep reinforcement learning (DRL) agents to achieve superhuman competence in Dota 2 matches