LAPSE:2023.5336
Published Article

LAPSE:2023.5336
IP Analytics and Machine Learning Applied to Create Process Visualization Graphs for Chemical Utility Patents
February 23, 2023
Abstract
Researchers must read and understand a large volume of technical papers, including patent documents, to fully grasp the state-of-the-art technological progress in a given domain. Chemical research is particularly challenging with the fast growth of newly registered utility patents (also known as intellectual property or IP) that provide detailed descriptions of the processes used to create a new chemical or a new process to manufacture a known chemical. The researcher must be able to understand the latest patents and literature in order to develop new chemicals and processes that do not infringe on existing claims and processes. This research uses text mining, integrated machine learning, and knowledge visualization techniques to effectively and accurately support the extraction and graphical presentation of chemical processes disclosed in patent documents. The computer framework trains a machine learning model called ALBERT for automatic paragraph text classification. ALBERT separates chemical and non-chemical descriptive paragraphs from a patent for effective chemical term extraction. The ChemDataExtractor is used to classify chemical terms, such as inputs, units, and reactions from the chemical paragraphs. A computer-supported graph-based knowledge representation interface is developed to plot the extracted chemical terms and their chemical process links as a network of nodes with connecting arcs. The computer-supported chemical knowledge visualization approach helps researchers to quickly understand the innovative and unique chemical or processes of any chemical patent of interest.
Researchers must read and understand a large volume of technical papers, including patent documents, to fully grasp the state-of-the-art technological progress in a given domain. Chemical research is particularly challenging with the fast growth of newly registered utility patents (also known as intellectual property or IP) that provide detailed descriptions of the processes used to create a new chemical or a new process to manufacture a known chemical. The researcher must be able to understand the latest patents and literature in order to develop new chemicals and processes that do not infringe on existing claims and processes. This research uses text mining, integrated machine learning, and knowledge visualization techniques to effectively and accurately support the extraction and graphical presentation of chemical processes disclosed in patent documents. The computer framework trains a machine learning model called ALBERT for automatic paragraph text classification. ALBERT separates chemical and non-chemical descriptive paragraphs from a patent for effective chemical term extraction. The ChemDataExtractor is used to classify chemical terms, such as inputs, units, and reactions from the chemical paragraphs. A computer-supported graph-based knowledge representation interface is developed to plot the extracted chemical terms and their chemical process links as a network of nodes with connecting arcs. The computer-supported chemical knowledge visualization approach helps researchers to quickly understand the innovative and unique chemical or processes of any chemical patent of interest.
Record ID
Keywords
bidirectional encoder representations (ALBERT), chemical manufacturing process visualization, chemical utility patents, IP analytics, knowledge graph visualization, Machine Learning, text mining
Subject
Suggested Citation
Trappey AJC, Trappey CV, Liang CP, Lin HJ. IP Analytics and Machine Learning Applied to Create Process Visualization Graphs for Chemical Utility Patents. (2023). LAPSE:2023.5336
Author Affiliations
Trappey AJC: Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 300, Taiwan [ORCID]
Trappey CV: Department of Management Science, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan [ORCID]
Liang CP: Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 300, Taiwan
Lin HJ: Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 300, Taiwan
Trappey CV: Department of Management Science, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan [ORCID]
Liang CP: Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 300, Taiwan
Lin HJ: Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 300, Taiwan
Journal Name
Processes
Volume
9
Issue
8
First Page
1342
Year
2021
Publication Date
2021-07-30
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr9081342, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2023.5336
This Record
External Link

https://doi.org/10.3390/pr9081342
Publisher Version
Download
Meta
Record Statistics
Record Views
347
Version History
[v1] (Original Submission)
Feb 23, 2023
Verified by curator on
Feb 23, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2023.5336
Record Owner
Auto Uploader for LAPSE
Links to Related Works
(0.33 seconds) 0.01 + 0.02 + 0.16 + 0.06 + 0 + 0.03 + 0.01 + 0 + 0.01 + 0.02 + 0 + 0
