arxiv_cs_lg 2026年2月10日

DrugR: 大規模言語モデルによる明示的推論を通じた分子医薬の最適化

DrugR: Optimizing Molecular Drugs through LLM-based Explicit Reasoning

Translated: 2026/3/15 15:03:11

drug-designllmmolecular-optimizationreinforcement-learningadmet

Japanese Translation

arXiv:2602.08213v1 発表タイプ：新規要約：分子生成と最適化は化学分野における基本的かつ重要なタスクです。高度な知識蓄積力と対話機能を持った、特に大規模言語モデル（LLM）を含む知性の道具の急成長により、この分野に新しいパラダイムが提供されています。しかし、LLM に内在する課題は、分子構造と薬理学的性質の複雑な明示的関係の欠如、および対応するラベル付きデータが不足しているという点にあります。このギャップを埋めるため、私たちは DrugR という提案をします。DrugR は、大規模言語モデルベースの手法であり、最適化プロセスに明示的かつ段階的な薬理学的推論を導入します。我々のアプローチは、ドメイン固有の継続的な事前学習、逆データ工学を介した教師あり微調整、そして自己調整されたマルチグラニュームの強化学習を統合します。このフレームワークにより、DrugR は元の分子の核心的な有効性を損なわずながら、重要な ADMET 特性を効果的に改善できます。実験結果は、DrugR が複数の特性にわたる包括的な改善を実現し、構造類似性や結合親和性を損なわないことを示しています。特に、DrugR の明示的推論プロセスは、各最適化段階に対する明確で解釈可能な正当性を提供し、実行可能な設計知見を生成し、自律的かつ知識駆動型の科学的発見へと進展させます。我々のコードとモデルチェックポイントはオープンソース化されました。

Original Content

arXiv:2602.08213v1 Announce Type: new Abstract: Molecule generation and optimization is a fundamental task in chemical domain. The rapid development of intelligent tools, especially large language models (LLMs) with powerful knowledge reserves and interactive capabilities, has provided new paradigms for it. Nevertheless, the intrinsic challenge for LLMs lies in the complex implicit relationship between molecular structure and pharmacological properties and the lack of corresponding labeled data. To bridge this gap, we propose DrugR, an LLM-based method that introduces explicit, step-by-step pharmacological reasoning into the optimization process. Our approach integrates domain-specific continual pretraining, supervised fine-tuning via reverse data engineering, and self-balanced multi-granular reinforcement learning. This framework enables DrugR to effectively improve key ADMET properties while preserving the original molecule's core efficacy. Experimental results demonstrate that DrugR achieves comprehensive enhancement across multiple properties without compromising structural similarity or target binding affinity. Importantly, its explicit reasoning process provides clear, interpretable rationales for each optimization step, yielding actionable design insights and advancing toward automated, knowledge-driven scientific discovery. Our code and model checkpoints are open-sourced to foster future research.