<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>Amirkabir University of Technology</PublisherName>
				<JournalTitle>AUT Journal of Mechanical Engineering</JournalTitle>
				<Issn>2588-2937</Issn>
				<Volume>10</Volume>
				<Issue>1</Issue>
				<PubDate PubStatus="epublish">
					<Year>2026</Year>
					<Month>01</Month>
					<Day>01</Day>
				</PubDate>
			</Journal>
<ArticleTitle>Towards Reliable Deep Reinforcement Learning for Industrial Applications: A DDPG-based Algorithm with Improved Performance</ArticleTitle>
<VernacularTitle></VernacularTitle>
			<FirstPage>61</FirstPage>
			<LastPage>74</LastPage>
			<ELocationID EIdType="pii">5842</ELocationID>
			
<ELocationID EIdType="doi">10.22060/ajme.2025.24180.6181</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Mahdi</FirstName>
					<LastName>Dolati</LastName>
<Affiliation>Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran</Affiliation>

</Author>
<Author>
					<FirstName>Negin</FirstName>
					<LastName>Sayyaf</LastName>
<Affiliation>Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran</Affiliation>
<Identifier Source="ORCID">0000-0002-1315-5250</Identifier>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2025</Year>
					<Month>05</Month>
					<Day>21</Day>
				</PubDate>
			</History>
		<Abstract>This paper proposes an Improved Model-Based Deep Deterministic Policy Gradient, a novel reinforcement learning algorithm designed to overcome three critical challenges in industrial deep reinforcement learning applications: (1) poor sample efficiency requiring excessive real-world trials, (2) safety risks from unstable policies during training, and (3) difficulty scaling to high-dimensional continuous control spaces. Building on DDPG&#039;s strengths for continuous control, the proposed algorithm introduces four key innovations: (i) a virtual environment for data-efficient learning, (ii) a simulation rate mechanism adapting model reliance dynamically, (iii) a simulated experience buffer preventing divergence, and (iv) a performance threshold for fail-safe operation. Evaluated on the Cart-Pole benchmark via the OpenAI Gym Python library, the suggested method demonstrates faster convergence than standard DDPG while maintaining performance degradation under sensor malfunctions or communication losses. These improvements derive from the algorithm&#039;s unique ability to simultaneously leverage real-world data and model-generated experiences, reducing physical trial costs while ensuring operational safety. The results establish the novel framework as a practical solution for industrial control systems where reliability and data efficiency are paramount, particularly in applications like chemical process control and precision robotics that demand stable operation amid sensor/communication failures.</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">deep reinforcement learning</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Model-Based Method</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Deep Deterministic Policy Gradient</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Industrial Applications</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">System identification</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://ajme.aut.ac.ir/article_5842_fb3ea77a2b3f8e7cb0e4e6699568d43d.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
