Find link

language:

jump to random article

Find link is a tool written by Edward Betts.

searching for Deep reinforcement learning 84 found (113 total)

alternate case: deep reinforcement learning

Q-learning (3,835 words) [view diff] case mismatch in snippet view article find links to article

2015). "Deep Reinforcement Learning with Double Q-learning". arXiv:1509.06461 [cs.LG]. van Hasselt, Hado; Guez, Arthur; Silver, David (2015). "Deep reinforcement
Dharshan Kumaran (187 words) [view diff] exact match in snippet view article find links to article
more than 20,000 articles, is '"Human-level control through deep reinforcement learning" which he co-authored in 2015 with others including V Mnih, K
Adversarial machine learning (7,802 words) [view diff] exact match in snippet view article find links to article
by the 2-norm is equivalent to Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing
Ansatz (645 words) [view diff] exact match in snippet view article find links to article
; Prati, E. (2019). "Coherent transport of quantum states by deep reinforcement learning". Communications Physics. 2 (1): 61. arXiv:1901.06603. Bibcode:2019CmPhy
Cognitive architecture (1,252 words) [view diff] case mismatch in snippet view article find links to article
Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;
Timothy Lillicrap (911 words) [view diff] case mismatch in snippet view article find links to article
David Silver, Daan Wierstra (2015). Continuous Control with Deep Reinforcement Learning. arXiv:1509.02971 Nicolas Heess, Jonathan J. Hunt, Timothy Lillicrap
Intelligent control (458 words) [view diff] exact match in snippet view article find links to article
supposed to capture the dynamics of a system. For the control part, deep reinforcement learning has shown its ability to control complex systems. Bayesian probability
Baher Abdulhai (1,928 words) [view diff] exact match in snippet view article find links to article
the impacts of AVs on the capacities of highway systems. Using deep reinforcement learning and high dimensional sensory inputs, he performed a case study
ACM Prize in Computing (135 words) [view diff] exact match in snippet view article find links to article
to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. 2020 Scott Aaronson For groundbreaking contributions
Palletizer (539 words) [view diff] case mismatch in snippet view article find links to article
position on the pallet. In recent years, some research has utilized Deep Reinforcement Learning, where robotic agents aim to learn an optimal placement position
David Silver (computer scientist) (712 words) [view diff] exact match in snippet view article
Silver; et al. (25 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. doi:10.1038/NATURE14236. ISSN 1476-4687
Dorothy Okello (1,176 words) [view diff] exact match in snippet view article find links to article
published in the 2020 IST-Africa Conference (IST-Africa) (3) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment
Machine learning in video games (3,884 words) [view diff] exact match in snippet view article find links to article
state of the art machine learning techniques such as relational deep reinforcement learning, long short-term memory, auto-regressive policy heads, pointer
Maluuba (1,274 words) [view diff] exact match in snippet view article find links to article
Maluuba published a research paper learning dialogue policies with deep reinforcement learning. In 2016, Maluuba also freely released the Frames dataset, which
Lit pool (259 words) [view diff] exact match in snippet view article find links to article
making and incentives design in the presence of a dark pool: a deep reinforcement learning approach". arXiv:1912.01129 [q-fin.MF]. Palmer, Max (2010-03-20)
Networked-loan (3,353 words) [view diff] exact match in snippet view article find links to article
with deep reinforcement learning integrated with high-order graph message-passing networks. It uses the framework of deep reinforcement learning to learn
Daniel Kroening (589 words) [view diff] case mismatch in snippet view article find links to article
"Deepsynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning". AAAI 2020, Vol. 35, No. 9, pages 7647-7656. Vijay D’Silva,
RFM (market research) (863 words) [view diff] case mismatch in snippet view article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv
Apprenticeship learning (1,336 words) [view diff] exact match in snippet view article find links to article
Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. In Advances in Neural Information Processing
Beam tilt (746 words) [view diff] case mismatch in snippet view article find links to article
"Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning". IEEE Transactions on Cognitive Communications and Networking
Imitation learning (1,285 words) [view diff] case mismatch in snippet view article find links to article
Ahmad A. Al; Yogamani, Senthil; Perez, Patrick (June 2022). "Deep Reinforcement Learning for Autonomous Driving: A Survey". IEEE Transactions on Intelligent
AC-3 algorithm (799 words) [view diff] case mismatch in snippet view article find links to article
domain. Minh, Volodymyr (16 Jun 2016). "Asynchronous Methods for Deep Reinforcement Learning". arXiv:gr-qc/0610068. A.K. Mackworth. Consistency in networks
Demis Hassabis (5,948 words) [view diff] exact match in snippet view article find links to article
learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods. Hassabis has predicted that
DeepStack (675 words) [view diff] exact match in snippet view article find links to article
Bakhtin, Anton; Lerer, Adam; Gong, Qucheng (2020). "Combining deep reinforcement learning and search for imperfect-information games". Advances in Neural
Henry X. Liu (1,119 words) [view diff] exact match in snippet view article find links to article
The advancement is enabled by the development of the dense deep reinforcement learning (D2RL) approach, which allows neural networks to learn from densified
Active learning (machine learning) (2,205 words) [view diff] case mismatch in snippet view article
https://arxiv.org/abs/2303.01560v2 Learning how to Active Learn: A Deep Reinforcement Learning Approach, Meng Fang, Yuan Li, Trevor Cohn, https://arxiv.org/abs/1708
Swarm robotics (2,536 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Hu, J.; Turgut
Michael Witbrock (716 words) [view diff] case mismatch in snippet view article find links to article
applications. Witbrock, Michael J., Srinivas, K., Thost, V., et al. "A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving," in Proceedings
OpenAI Five (2,260 words) [view diff] case mismatch in snippet view article find links to article
Venture Beat. Retrieved 22 April 2019. "Dota 2 with Large Scale Deep Reinforcement Learning" (PDF). OpenAI. Archived (PDF) from the original on 26 September
Microswimmer (14,757 words) [view diff] exact match in snippet view article find links to article
trapped in certain flow structures by learning smart gravitaxis. Deep reinforcement learning has been used to explore microswimmer navigation problems in
Seega (game) (905 words) [view diff] case mismatch in snippet view article
Player for Seejeh (A.K.A Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning". Procedia Computer Science. 160: 241–247. doi:10.1016/j.procs
Dorin Comaniciu (881 words) [view diff] case mismatch in snippet view article find links to article
Andreas; Hornegger, Joachim; Comaniciu, Dorin (2019). "Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans". IEEE Transactions
Tearing mode (198 words) [view diff] exact match in snippet view article find links to article
(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10
Chainer (857 words) [view diff] exact match in snippet view article find links to article
previous record held by Facebook. ChainerRL adds state of art deep reinforcement learning algorithms, and ChainerUI is a management and visualization tool
Paul Christiano (1,205 words) [view diff] case mismatch in snippet view article find links to article
single charity. At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement
InterQuest Group Ltd (1,073 words) [view diff] case mismatch in snippet view article find links to article
Conference". Retrieved 27 November 2019. "Step into the AI Era: Deep Reinforcement Learning Workshop". Retrieved 27 November 2019. "UX Sessions". Retrieved
Princeton Plasma Physics Laboratory (2,158 words) [view diff] exact match in snippet view article find links to article
Egemen (2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. doi:10.1038/s41586-024-07024-9
Montezuma's Revenge (video game) (1,398 words) [view diff] exact match in snippet view article
Petersen, Stig (February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10
Peter Stone (professor) (1,023 words) [view diff] exact match in snippet view article
Nature entitled Outracing champion Gran Turismo drivers with deep reinforcement learning, which reported on the creation of GT Sophy, a superhuman driving
Mahjong and artificial intelligence (555 words) [view diff] case mismatch in snippet view article find links to article
Yang; Li Zhao; Tao Qin; Tie-Yan Liu; Hsiao-Wuen Hon (2020-04-01). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI].
Artificial intelligence (28,540 words) [view diff] exact match in snippet view article find links to article
against four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capable
Nested sampling algorithm (2,266 words) [view diff] exact match in snippet view article find links to article
framework for uncertainty quantification, optimization, and deep reinforcement learning, which also implements nested sampling. Since nested sampling
Customer lifetime value (2,890 words) [view diff] case mismatch in snippet view article find links to article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv
Distributional Soft Actor Critic (369 words) [view diff] case mismatch in snippet view article find links to article
et al. (2018). "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor". ICML: 1861–1870. arXiv:1801.01290.
Gregory Dudek (1,273 words) [view diff] exact match in snippet view article find links to article
decision-making under uncertainty, using techniques including deep reinforcement learning and probabilistic modelling. Dudek has participated in the organization
Agent-based computational economics (1,860 words) [view diff] exact match in snippet view article find links to article
wholesale electricity markets realistically with multi-agent deep reinforcement learning". Energy and AI. 14: 100295. doi:10.1016/j.egyai.2023.100295
Evaluation function (2,436 words) [view diff] case mismatch in snippet view article find links to article
ICCA Journal Lai, Matthew (4 September 2015), Giraffe: Using Deep Reinforcement Learning to Play Chess, arXiv:1509.01549v1 "Neural network topology".
Vehicular ad hoc network (3,307 words) [view diff] exact match in snippet view article find links to article
Alhussan, Amel Ali; Khafaga, Doaa Sami (2023). "An improved deep reinforcement learning routing technique for collision-free VANET". Scientific Reports
Google Brain (4,223 words) [view diff] exact match in snippet view article find links to article
ISSN 1941-0468. Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. (May 2017). "Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates"
Pushmeet Kohli (1,060 words) [view diff] exact match in snippet view article find links to article
(February 2022). "Magnetic control of tokamak plasmas through deep reinforcement learning". Nature. 602 (7897): 414–419. Bibcode:2022Natur.602..414D. doi:10
Superintelligence (4,472 words) [view diff] case mismatch in snippet view article find links to article
Tom B.; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences" (PDF). NeurIPS. arXiv:1706.03741. "Constitutional
Convolutional neural network (15,593 words) [view diff] exact match in snippet view article find links to article
research described an application to Atari 2600 gaming. Other deep reinforcement learning models preceded it. Convolutional deep belief networks (CDBN)
Multipath TCP (2,819 words) [view diff] case mismatch in snippet view article find links to article
congestion control algorithm The Balanced Linked Increase Algorithm A deep Reinforcement Learning (DRL) framework for joint congestion control and packet scheduling
Edward Y. Chang (2,570 words) [view diff] exact match in snippet view article find links to article
, Chang, E. Y. (2018). Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Advances in Neural Information
MuJoCo (314 words) [view diff] case mismatch in snippet view article find links to article
Jorge Pena; Westerlund, Tomi (2020). "Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: A Survey". 2020 IEEE Symposium Series on Computational
AI alignment (12,973 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (June 28, 2022). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine
IIT Madras (8,259 words) [view diff] exact match in snippet view article find links to article
one of the country's largest groups in network analytics and deep reinforcement learning. Google has granted IIT Madras $1 million for setting up India's
Machine learning (15,339 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning". IEEE Transactions on Vehicular Technology. 69 (12): 14413–14423
AlphaDev (1,160 words) [view diff] exact match in snippet view article find links to article
Silver, David (2023). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618 (7964): 257–263. Bibcode:2023Natur.618..257M. doi:10
Rubik's Cube (10,324 words) [view diff] case mismatch in snippet view article find links to article
Prati (2021). "Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning". Journal of Physics A: Mathematical and Theoretical. 54 (5):
Pack hunter (6,192 words) [view diff] exact match in snippet view article find links to article
Keisuke (2022). "Collaborative hunting in artificial agents with deep reinforcement learning". doi:10.1101/2022.10.10.511517. Tomasello, Michael; Carpenter
OpenAI (19,509 words) [view diff] exact match in snippet view article find links to article
(MOBA) games and how OpenAI Five has demonstrated the use of deep reinforcement learning (DRL) agents to achieve superhuman competence in Dota 2 matches
Language creation in artificial intelligence (1,821 words) [view diff] case mismatch in snippet view article find links to article
Batra, D. (2017). Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning. arXiv:1703.06585. Johnson, Melvin; Schuster, Mike; Le, Quoc
List of volunteer computing projects (4,267 words) [view diff] exact match in snippet view article find links to article
2018-03-04 Software testing, chess Trains chess neural networks with deep reinforcement learning. Experiments with training parameters and net architectures No
Curriculum learning (1,367 words) [view diff] exact match in snippet view article find links to article
Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652.3159711. hdl:2142/101634.
Internet of things (20,089 words) [view diff] exact match in snippet view article find links to article
driving force for autonomous IoT. An approach in this context is deep reinforcement learning where most of IoT systems provide a dynamic and interactive environment
Mahjong (13,178 words) [view diff] case mismatch in snippet view article find links to article
Hsiao-Wuen (31 March 2020). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI]. Lam, Desmond. "Chinese Gambling Superstitions
Quantum machine learning (10,780 words) [view diff] case mismatch in snippet view article find links to article
Xiaoli; Goan, Hsi-Sheng (2020). "Variational Quantum Circuits for Deep Reinforcement Learning". IEEE Access. 8: 141007–141024. arXiv:1907.00397. Bibcode:2020IEEEA
Tokamak (14,234 words) [view diff] exact match in snippet view article find links to article
(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10
Reward hacking (1,510 words) [view diff] exact match in snippet view article find links to article
Barth-Maron, Gabriel; Vecerik, Matej; et al. (2017). "Data-efficient deep reinforcement learning for dexterous manipulation". arXiv:1704.03073 [cs.LG]. "Learning
Hover (behaviour) (1,811 words) [view diff] exact match in snippet view article
2023). "Exploring storm petrel pattering and sea-anchoring using deep reinforcement learning". Bioinspiration & Biomimetics. 18 (6). University of Portland
Big Five personality traits (20,389 words) [view diff] case mismatch in snippet view article find links to article
"Reining in Long Consumer Questionnaires with Self-Supervised Deep Reinforcement Learning" (PDF). Wharton JMP. Goldberg LR (December 1990). "An alternative
Federated learning (5,892 words) [view diff] case mismatch in snippet view article find links to article
Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression
Exploration–exploitation dilemma (1,855 words) [view diff] case mismatch in snippet view article find links to article
[cs.LG]. Weng, Lilian (2020-06-07). "Exploration Strategies in Deep Reinforcement Learning". lilianweng.github.io. Retrieved 2024-09-15. Şimşek, Özgür;
Occupant-centric building controls (1,924 words) [view diff] exact match in snippet view article find links to article
associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. Bibcode:2019BuEnv
Timeline of computing 2020–present (23,765 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10
Drones in wildfire management (4,523 words) [view diff] case mismatch in snippet view article find links to article
Mousavi, Seyed Sajad; Schukat, Michael; Howley, Enda (2018). "Deep Reinforcement Learning: An Overview". Proceedings of SAI Intelligent Systems Conference
Applications of artificial intelligence (19,806 words) [view diff] exact match in snippet view article find links to article
Hassabis, Demis (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10
Rainbow Dash (1,470 words) [view diff] exact match in snippet view article find links to article
"Rainbow Dash" that successfully taught itself to walk using deep reinforcement learning. The robot demonstrated the ability to learn to walk backward
Glossary of engineering: M–Z (31,124 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Feynman, Richard
Alexandre M. Bayen (2,843 words) [view diff] exact match in snippet view article find links to article
integration of microsimulation tools (SUMO and Aimsun) with early deep reinforcement learning libraries (RLlib and rllab) implemented on the cloud (AWS and
2023 in science (44,593 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10
AI safety (10,322 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (2022-06-28). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine
Reinforcement learning from human feedback (8,661 words) [view diff] case mismatch in snippet view article find links to article
Brown, Tom; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences". Advances in Neural Information Processing