language:
Find link is a tool written by Edward Betts.searching for Deep reinforcement learning 84 found (113 total)
alternate case: deep reinforcement learning
Q-learning
(3,835 words)
[view diff]
case mismatch in snippet
view article
find links to article
2015). "Deep Reinforcement Learning with Double Q-learning". arXiv:1509.06461 [cs.LG]. van Hasselt, Hado; Guez, Arthur; Silver, David (2015). "Deep reinforcementDharshan Kumaran (187 words) [view diff] exact match in snippet view article find links to article
more than 20,000 articles, is '"Human-level control through deep reinforcement learning" which he co-authored in 2015 with others including V Mnih, KAdversarial machine learning (7,802 words) [view diff] exact match in snippet view article find links to article
by the 2-norm is equivalent to Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusingAnsatz (645 words) [view diff] exact match in snippet view article find links to article
; Prati, E. (2019). "Coherent transport of quantum states by deep reinforcement learning". Communications Physics. 2 (1): 61. arXiv:1901.06603. Bibcode:2019CmPhyCognitive architecture (1,252 words) [view diff] case mismatch in snippet view article find links to article
Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;Timothy Lillicrap (911 words) [view diff] case mismatch in snippet view article find links to article
David Silver, Daan Wierstra (2015). Continuous Control with Deep Reinforcement Learning. arXiv:1509.02971 Nicolas Heess, Jonathan J. Hunt, Timothy LillicrapIntelligent control (458 words) [view diff] exact match in snippet view article find links to article
supposed to capture the dynamics of a system. For the control part, deep reinforcement learning has shown its ability to control complex systems. Bayesian probabilityBaher Abdulhai (1,928 words) [view diff] exact match in snippet view article find links to article
the impacts of AVs on the capacities of highway systems. Using deep reinforcement learning and high dimensional sensory inputs, he performed a case studyACM Prize in Computing (135 words) [view diff] exact match in snippet view article find links to article
to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. 2020 Scott Aaronson For groundbreaking contributionsPalletizer (539 words) [view diff] case mismatch in snippet view article find links to article
position on the pallet. In recent years, some research has utilized Deep Reinforcement Learning, where robotic agents aim to learn an optimal placement positionDavid Silver (computer scientist) (712 words) [view diff] exact match in snippet view article
Silver; et al. (25 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. doi:10.1038/NATURE14236. ISSN 1476-4687Dorothy Okello (1,176 words) [view diff] exact match in snippet view article find links to article
published in the 2020 IST-Africa Conference (IST-Africa) (3) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deploymentMachine learning in video games (3,884 words) [view diff] exact match in snippet view article find links to article
state of the art machine learning techniques such as relational deep reinforcement learning, long short-term memory, auto-regressive policy heads, pointerMaluuba (1,274 words) [view diff] exact match in snippet view article find links to article
Maluuba published a research paper learning dialogue policies with deep reinforcement learning. In 2016, Maluuba also freely released the Frames dataset, whichLit pool (259 words) [view diff] exact match in snippet view article find links to article
making and incentives design in the presence of a dark pool: a deep reinforcement learning approach". arXiv:1912.01129 [q-fin.MF]. Palmer, Max (2010-03-20)Networked-loan (3,353 words) [view diff] exact match in snippet view article find links to article
with deep reinforcement learning integrated with high-order graph message-passing networks. It uses the framework of deep reinforcement learning to learnDaniel Kroening (589 words) [view diff] case mismatch in snippet view article find links to article
"Deepsynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning". AAAI 2020, Vol. 35, No. 9, pages 7647-7656. Vijay D’Silva,RFM (market research) (863 words) [view diff] case mismatch in snippet view article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXivApprenticeship learning (1,336 words) [view diff] exact match in snippet view article find links to article
Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. In Advances in Neural Information ProcessingBeam tilt (746 words) [view diff] case mismatch in snippet view article find links to article
"Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning". IEEE Transactions on Cognitive Communications and NetworkingImitation learning (1,285 words) [view diff] case mismatch in snippet view article find links to article
Ahmad A. Al; Yogamani, Senthil; Perez, Patrick (June 2022). "Deep Reinforcement Learning for Autonomous Driving: A Survey". IEEE Transactions on IntelligentAC-3 algorithm (799 words) [view diff] case mismatch in snippet view article find links to article
domain. Minh, Volodymyr (16 Jun 2016). "Asynchronous Methods for Deep Reinforcement Learning". arXiv:gr-qc/0610068. A.K. Mackworth. Consistency in networksDemis Hassabis (5,948 words) [view diff] exact match in snippet view article find links to article
learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods. Hassabis has predicted thatDeepStack (675 words) [view diff] exact match in snippet view article find links to article
Bakhtin, Anton; Lerer, Adam; Gong, Qucheng (2020). "Combining deep reinforcement learning and search for imperfect-information games". Advances in NeuralHenry X. Liu (1,119 words) [view diff] exact match in snippet view article find links to article
The advancement is enabled by the development of the dense deep reinforcement learning (D2RL) approach, which allows neural networks to learn from densifiedActive learning (machine learning) (2,205 words) [view diff] case mismatch in snippet view article
https://arxiv.org/abs/2303.01560v2 Learning how to Active Learn: A Deep Reinforcement Learning Approach, Meng Fang, Yuan Li, Trevor Cohn, https://arxiv.org/abs/1708Swarm robotics (2,536 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Hu, J.; TurgutMichael Witbrock (716 words) [view diff] case mismatch in snippet view article find links to article
applications. Witbrock, Michael J., Srinivas, K., Thost, V., et al. "A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving," in ProceedingsOpenAI Five (2,260 words) [view diff] case mismatch in snippet view article find links to article
Venture Beat. Retrieved 22 April 2019. "Dota 2 with Large Scale Deep Reinforcement Learning" (PDF). OpenAI. Archived (PDF) from the original on 26 SeptemberMicroswimmer (14,757 words) [view diff] exact match in snippet view article find links to article
trapped in certain flow structures by learning smart gravitaxis. Deep reinforcement learning has been used to explore microswimmer navigation problems inSeega (game) (905 words) [view diff] case mismatch in snippet view article
Player for Seejeh (A.K.A Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning". Procedia Computer Science. 160: 241–247. doi:10.1016/j.procsDorin Comaniciu (881 words) [view diff] case mismatch in snippet view article find links to article
Andreas; Hornegger, Joachim; Comaniciu, Dorin (2019). "Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans". IEEE TransactionsTearing mode (198 words) [view diff] exact match in snippet view article find links to article
(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10Chainer (857 words) [view diff] exact match in snippet view article find links to article
previous record held by Facebook. ChainerRL adds state of art deep reinforcement learning algorithms, and ChainerUI is a management and visualization toolPaul Christiano (1,205 words) [view diff] case mismatch in snippet view article find links to article
single charity. At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcementInterQuest Group Ltd (1,073 words) [view diff] case mismatch in snippet view article find links to article
Conference". Retrieved 27 November 2019. "Step into the AI Era: Deep Reinforcement Learning Workshop". Retrieved 27 November 2019. "UX Sessions". RetrievedPrinceton Plasma Physics Laboratory (2,158 words) [view diff] exact match in snippet view article find links to article
Egemen (2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. doi:10.1038/s41586-024-07024-9Montezuma's Revenge (video game) (1,398 words) [view diff] exact match in snippet view article
Petersen, Stig (February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10Peter Stone (professor) (1,023 words) [view diff] exact match in snippet view article
Nature entitled Outracing champion Gran Turismo drivers with deep reinforcement learning, which reported on the creation of GT Sophy, a superhuman drivingMahjong and artificial intelligence (555 words) [view diff] case mismatch in snippet view article find links to article
Yang; Li Zhao; Tao Qin; Tie-Yan Liu; Hsiao-Wuen Hon (2020-04-01). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI].Artificial intelligence (28,540 words) [view diff] exact match in snippet view article find links to article
against four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capableNested sampling algorithm (2,266 words) [view diff] exact match in snippet view article find links to article
framework for uncertainty quantification, optimization, and deep reinforcement learning, which also implements nested sampling. Since nested samplingCustomer lifetime value (2,890 words) [view diff] case mismatch in snippet view article find links to article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXivDistributional Soft Actor Critic (369 words) [view diff] case mismatch in snippet view article find links to article
et al. (2018). "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor". ICML: 1861–1870. arXiv:1801.01290.Gregory Dudek (1,273 words) [view diff] exact match in snippet view article find links to article
decision-making under uncertainty, using techniques including deep reinforcement learning and probabilistic modelling. Dudek has participated in the organizationAgent-based computational economics (1,860 words) [view diff] exact match in snippet view article find links to article
wholesale electricity markets realistically with multi-agent deep reinforcement learning". Energy and AI. 14: 100295. doi:10.1016/j.egyai.2023.100295Evaluation function (2,436 words) [view diff] case mismatch in snippet view article find links to article
ICCA Journal Lai, Matthew (4 September 2015), Giraffe: Using Deep Reinforcement Learning to Play Chess, arXiv:1509.01549v1 "Neural network topology".Vehicular ad hoc network (3,307 words) [view diff] exact match in snippet view article find links to article
Alhussan, Amel Ali; Khafaga, Doaa Sami (2023). "An improved deep reinforcement learning routing technique for collision-free VANET". Scientific ReportsGoogle Brain (4,223 words) [view diff] exact match in snippet view article find links to article
ISSN 1941-0468. Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. (May 2017). "Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates"Pushmeet Kohli (1,060 words) [view diff] exact match in snippet view article find links to article
(February 2022). "Magnetic control of tokamak plasmas through deep reinforcement learning". Nature. 602 (7897): 414–419. Bibcode:2022Natur.602..414D. doi:10Superintelligence (4,472 words) [view diff] case mismatch in snippet view article find links to article
Tom B.; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences" (PDF). NeurIPS. arXiv:1706.03741. "ConstitutionalConvolutional neural network (15,593 words) [view diff] exact match in snippet view article find links to article
research described an application to Atari 2600 gaming. Other deep reinforcement learning models preceded it. Convolutional deep belief networks (CDBN)Multipath TCP (2,819 words) [view diff] case mismatch in snippet view article find links to article
congestion control algorithm The Balanced Linked Increase Algorithm A deep Reinforcement Learning (DRL) framework for joint congestion control and packet schedulingEdward Y. Chang (2,570 words) [view diff] exact match in snippet view article find links to article
, Chang, E. Y. (2018). Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Advances in Neural InformationMuJoCo (314 words) [view diff] case mismatch in snippet view article find links to article
Jorge Pena; Westerlund, Tomi (2020). "Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: A Survey". 2020 IEEE Symposium Series on ComputationalAI alignment (12,973 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (June 28, 2022). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on MachineIIT Madras (8,259 words) [view diff] exact match in snippet view article find links to article
one of the country's largest groups in network analytics and deep reinforcement learning. Google has granted IIT Madras $1 million for setting up India'sMachine learning (15,339 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning". IEEE Transactions on Vehicular Technology. 69 (12): 14413–14423AlphaDev (1,160 words) [view diff] exact match in snippet view article find links to article
Silver, David (2023). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618 (7964): 257–263. Bibcode:2023Natur.618..257M. doi:10Rubik's Cube (10,324 words) [view diff] case mismatch in snippet view article find links to article
Prati (2021). "Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning". Journal of Physics A: Mathematical and Theoretical. 54 (5):Pack hunter (6,192 words) [view diff] exact match in snippet view article find links to article
Keisuke (2022). "Collaborative hunting in artificial agents with deep reinforcement learning". doi:10.1101/2022.10.10.511517. Tomasello, Michael; CarpenterOpenAI (19,509 words) [view diff] exact match in snippet view article find links to article
(MOBA) games and how OpenAI Five has demonstrated the use of deep reinforcement learning (DRL) agents to achieve superhuman competence in Dota 2 matchesLanguage creation in artificial intelligence (1,821 words) [view diff] case mismatch in snippet view article find links to article
Batra, D. (2017). Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning. arXiv:1703.06585. Johnson, Melvin; Schuster, Mike; Le, QuocList of volunteer computing projects (4,267 words) [view diff] exact match in snippet view article find links to article
2018-03-04 Software testing, chess Trains chess neural networks with deep reinforcement learning. Experiments with training parameters and net architectures NoCurriculum learning (1,367 words) [view diff] exact match in snippet view article find links to article
Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652.3159711. hdl:2142/101634.Internet of things (20,089 words) [view diff] exact match in snippet view article find links to article
driving force for autonomous IoT. An approach in this context is deep reinforcement learning where most of IoT systems provide a dynamic and interactive environmentMahjong (13,178 words) [view diff] case mismatch in snippet view article find links to article
Hsiao-Wuen (31 March 2020). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI]. Lam, Desmond. "Chinese Gambling SuperstitionsQuantum machine learning (10,780 words) [view diff] case mismatch in snippet view article find links to article
Xiaoli; Goan, Hsi-Sheng (2020). "Variational Quantum Circuits for Deep Reinforcement Learning". IEEE Access. 8: 141007–141024. arXiv:1907.00397. Bibcode:2020IEEEATokamak (14,234 words) [view diff] exact match in snippet view article find links to article
(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10Reward hacking (1,510 words) [view diff] exact match in snippet view article find links to article
Barth-Maron, Gabriel; Vecerik, Matej; et al. (2017). "Data-efficient deep reinforcement learning for dexterous manipulation". arXiv:1704.03073 [cs.LG]. "LearningHover (behaviour) (1,811 words) [view diff] exact match in snippet view article
2023). "Exploring storm petrel pattering and sea-anchoring using deep reinforcement learning". Bioinspiration & Biomimetics. 18 (6). University of PortlandBig Five personality traits (20,389 words) [view diff] case mismatch in snippet view article find links to article
"Reining in Long Consumer Questionnaires with Self-Supervised Deep Reinforcement Learning" (PDF). Wharton JMP. Goldberg LR (December 1990). "An alternativeFederated learning (5,892 words) [view diff] case mismatch in snippet view article find links to article
Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm CompressionExploration–exploitation dilemma (1,855 words) [view diff] case mismatch in snippet view article find links to article
[cs.LG]. Weng, Lilian (2020-06-07). "Exploration Strategies in Deep Reinforcement Learning". lilianweng.github.io. Retrieved 2024-09-15. Şimşek, Özgür;Occupant-centric building controls (1,924 words) [view diff] exact match in snippet view article find links to article
associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. Bibcode:2019BuEnvTimeline of computing 2020–present (23,765 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10Drones in wildfire management (4,523 words) [view diff] case mismatch in snippet view article find links to article
Mousavi, Seyed Sajad; Schukat, Michael; Howley, Enda (2018). "Deep Reinforcement Learning: An Overview". Proceedings of SAI Intelligent Systems ConferenceApplications of artificial intelligence (19,806 words) [view diff] exact match in snippet view article find links to article
Hassabis, Demis (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10Rainbow Dash (1,470 words) [view diff] exact match in snippet view article find links to article
"Rainbow Dash" that successfully taught itself to walk using deep reinforcement learning. The robot demonstrated the ability to learn to walk backwardGlossary of engineering: M–Z (31,124 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Feynman, RichardAlexandre M. Bayen (2,843 words) [view diff] exact match in snippet view article find links to article
integration of microsimulation tools (SUMO and Aimsun) with early deep reinforcement learning libraries (RLlib and rllab) implemented on the cloud (AWS and2023 in science (44,593 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10AI safety (10,322 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (2022-06-28). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on MachineReinforcement learning from human feedback (8,661 words) [view diff] case mismatch in snippet view article find links to article
Brown, Tom; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences". Advances in Neural Information Processing