Resources & Industries ›› 2025, Vol. 27 ›› Issue (5): 123-138.DOI: 10.13776/j.cnki.resourcesindustries.20250901.002

Previous Articles     Next Articles

TEXT ANALYSIS OF CHINA'S HYDROGEN ENERGY POLICY BASED ON LDA THEMATIC MODEL

QIN Yu, HUANG Xiaoli, LIN Lü   

  1. (School of Economics and Management, China University of Petroleum, Beijing 102249, China)
  • Received:2024-11-15 Revised:2025-04-05 Online:2025-10-20 Published:2025-10-20

基于LDA主题模型的中国氢能政策文本分析

秦宇,黄晓莉,林绿   

  1. (中国石油大学(北京) 经济管理学院,北京 102249)
  • 通讯作者: 林绿,博士、副教授,主要从事能源经济管理、环境系统分析研究。E-mail:linlu@cup.edu.cn
  • 作者简介:秦宇,硕士生,主要从事低碳政策文本挖掘研究。E-mail:qinyu@student.cup.edu.cn
  • 基金资助:
    国家重点研发计划项目(2024YFE0213100);国家自然科学基金青年项目(71904203)

Abstract: This paper aims to study the text characteristics and evolution of China's existing hydrogen energy policy, uses 353 pieces of hydrogen energy policies from 2019 to 2023 to establish a four-dimensional framework of “temporal evolution-high frequency words analysis-thematic exploration-thematic evolution”, which is applied to study on 8 comprehensive economic zones. For temporal evolution, annual policy amounts, regional distribution and categories reveal a policy evolution and spatial differences. For high frequency words, Python's Jieba database is used to process policy texts through mapping words cloud and co-occurrence network, identifying policy focuses and regional attention. For thematic exploration, LDA mode combined with TF-IDF algorithm is used to dig the potential themes by mapping thematic words co-occurrence network and analyzing their connection between potential themes and themes. For thematic evolution, mixed proportion of each theme in the policy texts marks the thematic intensity and reveals their thematic evolutionary trend. Policy amounts show an “M-shaped” rising, peaking at 2020 and 2022, due to an issue of key policies. Eastern and northern coasts have the most policy issues, 78 and 76 respectively; planning and method categories are the most, 85 and 68 respectively; prevention category is the least with only one. Policies focus on many aspects including whole industrial chain development, safety supervision. From 2022 to 2023 policies pay more attention to hydrogen energy safety; hydrogen stations are key domain in infrastructural construction; local policies have a single application scenario, and industrial homogeneity is prominent, with fuel cells as the key demonstration application scenario; the frequency of green hydrogen-related words is higher than other high-frequency words. Hydrogen energy policy has five potential themes in enterprise support, safety management, industrial innovation, whole industrial chain development and demonstration; enterprise support is closely related to demonstration, and industrial innovation is closely related to whole industrial chain development; the internal characteristic words of safety management theme are highly aggregated. Policy thematic evolution trend is stable; enterprise support is the leading theme; safety management trend is stable after initial largely rising, lower than enterprise support. Regional policy thematic evolution trend varies a lot with economy, resources and hydrogen energy industrial development. This paper presents a “1+N” policy system regarding China's hydrogen energy industry on selecting localized development direction, increasing green hydrogen policy attention, executing safety supervision on the whole hydrogen industrial chain and enlarging multiple applied scenarios of hydrogen energy.

Key words: hydrogen energy policy, high frequency words, thematic evolution, LDA

摘要: 为了系统研究中国现有氢能产业政策的文本特征与演进规律,以2019—2023年353份中国氢能政策为研究对象,构建“时间演化-高频词分析-主题挖掘-主题演变”政策四维分析框架,结合八大综合经济区展开分析。其中,在时间演化维度,通过统计年度政策数量、区域分布及类型特征,揭示政策发展轨迹与空间差异;在高频词分析维度,基于Python的Jieba库对样本政策措施文本分词处理,绘制词云图与共现网络,识别政策关注焦点及区域注意力差异;在主题挖掘维度,采用LDA模型结合TF-IDF算法挖掘样本政策潜在主题,绘制主题词共现网络,解析政策文本的潜在主题特征与主题间联结关系;在主题演变维度,借助文档集中各主题的混合比例来表征主题强度,探究政策主题演变趋势。研究表明:1)政策数量呈“M型”波动上升,有2020年、2022年两个波峰,关键政策的发布是直接原因;东部与北部沿海是政策发布的高地(分别为78、76份);规划类与办法类占据主导(分别为85、68份),预案类最少(1份)。2)政策对全产业链发展、安全监管等多个方面高度关注;2022—2023年对氢能安全管理的关注有提升;加氢站建设是基础设施中的重点领域;地方政策应用场景单一,产业同质化突出,均将燃料电池作为重点示范应用场景;绿氢相关词汇与其他高频词频次有明显差距。3)氢能政策有企业扶持、安全管理、产业创新、全产业链发展、示范应用5个潜在主题;企业扶持与示范应用,产业创新与全产业链发展联系较为紧密;安全管理主题内部特征词高度聚合。4)政策主题演变趋势整体平稳,企业扶持作为核心议题始终占据主导地位,安全管理在经历初期的大幅提升后趋于稳定,略低于企业扶持;区域间政策主题演变趋势差异显著,各地因经济、资源及氢能产业发展阶段不同,采取了差异化策略。最后就中国氢能产业发展提出加快构建“1+N”政策体系、因地制宜差异化选择发展方向、提升绿氢政策聚焦度、落实氢能全产业链安全防护监测预警及拓展氢能多元化应用场景的建议。

关键词: 氢能政策, 高频词分析, 主题演变, LDA

CLC Number: