• 中文核心期刊要目总览
  • 中国科技核心期刊
  • 中国科学引文数据库(CSCD)
  • 中国科技论文与引文数据库(CSTPCD)
  • 中国学术期刊文摘数据库(CSAD)
  • 中国学术期刊(网络版)(CNKI)
  • 中文科技期刊数据库
  • 万方数据知识服务平台
  • 中国超星期刊域出版平台
  • 国家科技学术期刊开放平台
  • 荷兰文摘与引文数据库(SCOPUS)
  • 日本科学技术振兴机构数据库(JST)

DIFFNMR: DIFFUSION MODELS FOR NUCLEAR MAGNETIC RESONANCE SPECTRA ELUCIDATION

DIFFNMR: DIFFUSION MODELS FOR NUCLEAR MAGNETIC RESONANCE SPECTRA ELUCIDATION

  • 摘要: Nuclear Magnetic Resonance (NMR) spectroscopy is a key method for molecular structure elucidation. However, interpreting NMR spectra to deduce molecular structures remains challenging due to the complexity of spectral data and the vastness of the chemical space. Here we introduce DiffNMR, a novel end-to-end framework that leverages a conditional discrete diffusion model for de novo molecular structure elucidation from NMR spectra. DiffNMR refines molecular graphs iteratively through a diffusion-based generative process, ensuring global consistency and mitigating error accumulation inherent in autoregressive methods. The framework integrates a two-stage pretraining strategy that aligns spectral and molecular representations via a diffusion autoencoder (Diff-AE) and contrastive learning. It also incorporates retrieval initialization and similarity filtering during inference. Our experimental results demonstrate that DiffNMR achieves competitive performance for NMR-based structure elucidation, especially outperforming autoregressive models in domain generalization and robustness, thereby offering an efficient and robust solution for automated molecular analysis.

     

    Abstract: Nuclear Magnetic Resonance (NMR) spectroscopy is a key method for molecular structure elucidation. However, interpreting NMR spectra to deduce molecular structures remains challenging due to the complexity of spectral data and the vastness of the chemical space. Here we introduce DiffNMR, a novel end-to-end framework that leverages a conditional discrete diffusion model for de novo molecular structure elucidation from NMR spectra. DiffNMR refines molecular graphs iteratively through a diffusion-based generative process, ensuring global consistency and mitigating error accumulation inherent in autoregressive methods. The framework integrates a two-stage pretraining strategy that aligns spectral and molecular representations via a diffusion autoencoder (Diff-AE) and contrastive learning. It also incorporates retrieval initialization and similarity filtering during inference. Our experimental results demonstrate that DiffNMR achieves competitive performance for NMR-based structure elucidation, especially outperforming autoregressive models in domain generalization and robustness, thereby offering an efficient and robust solution for automated molecular analysis.

     

/

返回文章
返回