Skip to content
Open
Show file tree
Hide file tree
Changes from 10 commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
d8fc7cc
refactor and support for multi algs fusion
n1ck-guo May 22, 2026
18e8b15
Merge branch 'main' into hengguo/refactor_algs
n1ck-guo May 26, 2026
4de34b8
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] May 26, 2026
bcf3633
fix bugs
n1ck-guo May 26, 2026
92a9723
fix and hanle shared config
n1ck-guo May 26, 2026
bc6569d
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] May 26, 2026
a9fc7df
merge main
n1ck-guo May 26, 2026
f0e8483
Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto…
n1ck-guo May 26, 2026
5a71dbf
relocate awq
n1ck-guo May 27, 2026
dae7221
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo May 27, 2026
533f12b
refactor scheme and entry
n1ck-guo May 28, 2026
6a6f97a
modify by comments
n1ck-guo May 28, 2026
d3a391c
merge main
n1ck-guo May 28, 2026
ecefed9
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo May 29, 2026
cab89be
fix ut
n1ck-guo May 29, 2026
8b2af76
fix
n1ck-guo May 29, 2026
6f3abc7
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo May 29, 2026
0ea5774
add llmc api
n1ck-guo May 29, 2026
9c7853f
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo Jun 3, 2026
4c5c470
fix gguf error
n1ck-guo Jun 3, 2026
8e12461
fix ut
n1ck-guo Jun 4, 2026
c470601
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo Jun 4, 2026
8e4fd0e
fix
n1ck-guo Jun 4, 2026
1fdb43a
Merge branch 'main' into hengguo/refactor_algs
n1ck-guo Jun 4, 2026
40a67fa
Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
n1ck-guo Jun 4, 2026
b4ae543
Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto…
n1ck-guo Jun 4, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 2 additions & 5 deletions .claude/skills/adapt-new-diffusion-model/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,15 +52,12 @@ def is_diffusion_model(model_or_path):
```

If your model doesn't have `model_index.json`, either create one in the model
directory or pass diffusion-specific options through new-architecture
`ExtraConfig` / `AutoRound` kwargs:
directory or pass diffusion-specific options through `AutoRound` kwargs:

```python
from auto_round.compressors.config import ExtraConfig

ar = AutoRound(
model,
extra_config=ExtraConfig(num_inference_steps=5),
num_inference_steps=5,
)
```

Expand Down
2 changes: 1 addition & 1 deletion .claude/skills/add-vlm-model/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -104,7 +104,7 @@ The new architecture routes multimodal calibration through:

If your model works with an existing template/processor, prefer passing
`template=...`, `processor=...`, or `image_processor=...` through `AutoRound` /
`ExtraConfig` instead of adding compressor code.
kwargs instead of adding compressor code.

## Step 3: Add Calibration Template

Expand Down
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -12,3 +12,4 @@ tmp_autoround/
ut_log_dir/
CLAUDE.local.md
docs/plan/
.codegraph/
7 changes: 7 additions & 0 deletions AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -71,3 +71,10 @@ Test fixtures create tiny models (OPT-125M, Qwen-0.6B) at session scope — firs
- `setup.py` forces `CC=CXX=g++` at import time
- Version is computed dynamically from git tags — untagged commits produce dev versions
- Some test dependencies (AutoAWQ, GPTQModel, llama-cpp) require manual git installs — see comments in `test/test_cuda/requirements.txt`

## AI 行为准则
Comment thread
n1ck-guo marked this conversation as resolved.
Outdated

1. **Think Before Coding** — 改动前先复述目标、找到相关代码位置、说出预期影响,再写。
2. **Simplicity First** — 优先用最简方案。能改 3 行不改 30 行;能复用不新建。
3. **Surgical Changes** — 只动跟当前任务相关的代码。看到不顺眼但无关的代码,记下来不改。
4. **Goal-Driven Execution** — 完成后必须用可观测方式验证(运行测试 / 打印结果 / 跑示例),不靠"我觉得对了"。
Loading
Loading