The final note should not read like raw extracted evidence.
Use the structured artifacts as inputs:
metadata.jsonevidence_pack.jsonfigure_plan.jsonsynthesis_bundle.json
Then let the model draft the final note in natural language.
Near the beginning of the note, include:
## 核心信息## 原文摘要翻译## 创新点## 一句话总结
## 核心信息 is a fixed metadata block, not an analysis block.
Rules for this section:
- only use the predefined metadata-style fields from the note template
- keep each line in
- 字段名: 值form - do not add ad hoc fields such as judgments, takeaways, or mini-summaries
- do not move explanatory prose, evaluation, or "my view" sentences into this section
- if a field is unavailable, leave it blank or mark it as not clearly available rather than replacing the field with commentary
The 原文摘要翻译 section should be a Chinese translation of the paper's original abstract:
- if the abstract is available, translate the original abstract into Chinese before the one-sentence summary
- do not let the summary replace the abstract
- do not treat
原文摘要翻译as your own summary of the full paper; it is the original abstract translated into Chinese - do not split this section into
### 英文原文and### 中文翻译 - keep the section title exactly as
原文摘要翻译 - the
原文摘要翻译section itself must be written in Chinese; do not output English abstract sentences or English-original paragraphs here - the Chinese abstract should be fluent and faithful, not a second
一句话总结 - do not turn
原文摘要翻译into a selective excerpt or a compressed highlight list - do not add judgments, hindsight, or details learned from later sections of the paper into
原文摘要翻译; only translate what the original abstract says
The 创新点 section should be a dedicated top-level section after 原文摘要翻译 rather than a hidden bullet buried later.
It should usually:
- enumerate 3 to 5 paper-specific innovations
- explain what problem each innovation addresses
- explain what new capability, mechanism, or evaluation angle it enables
- avoid generic praise such as
the paper is novelwithout locating the novelty
Default to a high-bar technical reader and writer persona:
- you are a top-tier AI researcher and algorithm engineer
- you are preparing an internal replication-oriented reading note for your lab
- you are not writing a science-pop summary
- you should assume the reader is comfortable with Python, PyTorch, training loops, evaluation protocols, and ablation logic
For technical or method papers, write as if the note may later be used for:
- implementation planning
- reproduction
- comparison against later papers
- deciding whether the method is actually novel or just well-packaged
- explain the paper rather than quote it
- distinguish research problem from task definition
- explain the method or analysis flow in your own words
- choose the most meaningful results rather than repeating every number
- say what the paper does not prove
- keep the note readable weeks later
- make the technical core understandable enough for an engineer to re-explain it
Scripts are good at:
- resolution
- extraction
- formatting
- linting
- placeholder planning
Scripts are not enough on their own for:
- nuanced judgment
- identifying what is easy to misread
- deciding what the paper's real contribution is
- writing strong, natural Chinese analytical prose
The language model should do all of the following:
- infer the paper type from the evidence bundle
- make an explicit short
note_planbefore drafting - decide which sections need more weight
- decide where
###subheadings are needed - select the truly central results
- reconstruct the method or analysis flow
- decide whether the paper needs explicit LaTeX formulas for the core objective, factorization, or complexity
- write the final note in clean Chinese
The note should feel like:
- a careful reading note
- not an abstract rewrite
- not a raw evidence dump
- not a benchmark table converted into bullets
The final Chinese note must also pass a language-cleanliness check:
- no half-English half-Chinese prose lines
- English is allowed only for stable proper nouns or citation metadata
- if the style gate fails, do not write the note into Obsidian yet
- do not write for the linter; lint is only a minimum floor, not the writing objective
- after script lint passes,
final_readability_reviewis still required before the note should be treated as polished and ready to save
正文术语策略:
- default to natural Chinese prose in正文分析
- keep English only when it is a stable proper noun or source-faithful technical label
- stable English that may remain:
- model names
- dataset names
- metric names
- method names
- math symbols
- code tokens
- original paper figure/table ids
- English that should usually be rewritten into natural Chinese:
- ordinary English phrases
- abstract descriptive phrases in analytical prose
- leftover English wording that has no clear reason to remain
- when a first mention benefits from both forms, prefer Chinese-first wording with an English gloss in parentheses
- do not leave phrases such as
reasoning dataset,distillation risk, orreward model qualitydirectly inside Chinese prose when a natural Chinese rendering is available
For non-trivial papers, the note should usually not stop at only broad ## sections.
It should use meaningful ### subheadings where they improve technical clarity.
Before the final draft exists, there should already be a compact structured planning artifact such as <note_plan>...</note_plan> or an equivalent temporary planning file.
This plan should be short and inspectable.
Do not require or expose a long free-form <thinking> block.
Examples:
### 数据来源### 任务定义### 中间特征抽取### 训练细节### 哪些结果最重要### 哪些地方容易被误读
For technical papers, also strongly consider subsections such as:
### 机制流程### 训练目标### 推理与采样链路### 关键实现细节### 复杂度与扩展性### 消融到底说明了什么
For method, framework, and system papers, prefer an explicit ### 机制流程 subsection instead of hiding the execution chain inside generic prose.
That subsection should usually be a 3 to 4 step numbered list covering:
- what the Input is
- what the main intermediate transformations are
- what the Output is
- what the training or inference loop is actually doing
- do not rely on a damaged Algorithm block to carry this explanation for you
- do not let the steps collapse into module-name listing; each step should describe an operation
- if a high-confidence pipeline or architecture figure matches this execution chain, place it in
### 机制流程
Do not avoid formulas by default. When the paper's method or claim depends on:
- a training objective
- a probability factorization
- a complexity expression
- a scaling-law fit
- a key update rule or optimization target
the note should usually include 1 to 3 essential LaTeX formulas in the relevant section.
Use formulas sparingly and purposefully:
- each formula should help explain the method
- do not dump many formulas just to look technical
- if the source extraction is noisy, prefer reconstructing a small, stable core formula rather than copying broken math verbatim
- after each retained formula, add one sentence explaining what it corresponds to in engineering or code terms
- do not only translate variable names; explain the concrete operation, loss term, update rule, or control effect
- formulas in the final Markdown should be written as directly renderable Obsidian/MathJax math, not as JSON-style escaped strings
- do not double-escape TeX commands such as
\\tau,\\frac,\\bar,\\begin, or\\endwhen the final note should contain\tau,\frac,\bar,\begin, or\end - use real math delimiters:
- inline math:
$...$ - display math:
$$ ... $$
- inline math:
- do not format formulas as inline code with backticks
- do not put formulas inside fenced code blocks unless you are literally discussing source code or pseudocode
Chinese paragraphs should read like natural prose, not like PDF fragments.
Do not leave:
- mid-sentence line breaks after commas or semicolons
- one sentence broken into many short physical lines
- raw PDF folding artifacts inside normal paragraphs
Allowed line breaks:
- between paragraphs
- bullet lists
- block quotes
- figure callouts
- fenced code or formula blocks
Start from placeholders, not from extracted images. The note should preserve the full figure/table structure even when image extraction is partial.
If the bundle contains candidate figure pages or candidate image files:
- use them as evidence for semantic matching
- prefer the candidate with the strongest caption/page-context agreement
- still make the final decision yourself rather than trusting the candidate ranking blindly
Final-note figure rules:
- keep the original paper numbering, such as
Fig. 1,Fig. 3,Table 2 - do not rename them to
图 1,图 2just because of note order - if you replace a placeholder with a real image, keep the same paper figure id in the caption
- if an important figure cannot be confidently extracted, keep a placeholder with a short explanation
- text may be complete even when figures are partial; do not let missing images erase textual coverage
- complete the figure decision inside the same task as the note generation
- do not stop after the text draft and ask the user whether to continue with figures unless they explicitly asked for a staged workflow
- prefer a stable figure callout format in the final note:
> [!figure] Fig. 3 ...> 建议位置:...> 放置原因:...> 当前状态:...
Before outputting the final Markdown, explicitly check:
- does the note contain concrete numbers, dimensions, complexity terms, or formulas when the paper clearly depends on them?
- can a reader familiar with Python and deep learning frameworks follow the core method from this note alone?
- does the method section explain the mechanism rather than only summarize the claim?
- if this is a method/system/framework paper, does
方法主线explicitly contain### 机制流程with a 3 to 4 step numbered list? - if the evidence bundle contains negative or unstable ablation signals, did the note include at least one of them?
- if the evidence bundle does not contain such signals, did the note explicitly say the paper did not clearly report failed or unstable settings?
- does the note contain at least one honest limitation and one paper-specific insight?
- are there any suspicious mid-sentence line breaks left in the prose?
- after script lint passes, have you reread the full note once more for readability rather than stopping at
lint passed? - are there still any stiff translations, awkward Chinese phrasing, or ordinary English phrases that should be rewritten into natural Chinese?
- are there any lines that sound like they were written only to satisfy lint or section compliance rather than to help a real reader?
- if the note includes LaTeX formulas, did you quickly check that the final Markdown uses directly renderable TeX rather than double-escaped commands or broken math delimiters?
This final readability review is a language-and-expression pass, not a second evidence-judgment pass:
- improve fluency and readability
- remove stiff translations
- convert ordinary English phrase leftovers into natural Chinese
- keep stable proper nouns when forcing a translation would sound worse
- do not invent new facts, numbers, comparisons, or failure cases during this pass
- do not use polish as an excuse to flatten the note into a safer but shallower summary
If the answer to the first three questions is no, the draft is still too shallow and should be revised before save.