Skip to content

Refine AutoScheme logic for gguf, reduce memory consumption and improve speed#1916

Open
wenhuach21 wants to merge 17 commits into
mainfrom
refine_dl
Open

Refine AutoScheme logic for gguf, reduce memory consumption and improve speed#1916
wenhuach21 wants to merge 17 commits into
mainfrom
refine_dl

Conversation

@wenhuach21

@wenhuach21 wenhuach21 commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

Description

Please briefly describe your main changes, the motivation.

Type of Change

Bug fix

Related Issues

Fixes or relates to #

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.
  • The CUDA CI has passed. You can trigger it by commenting /azp run Unit-Test-CUDA-AutoRound.
  • options类似重复的要去掉,类似gguf:q4_k_s,gguf:q4_k_m等

@chensuyue

Copy link
Copy Markdown
Contributor

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines

Copy link
Copy Markdown
Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21 wenhuach21 changed the title adjust auto scheme logic Refine AutoScheme logic to reduce memory consumption and improve speed Jun 12, 2026
@wenhuach21 wenhuach21 changed the title Refine AutoScheme logic to reduce memory consumption and improve speed Refine AutoScheme logic, reduce memory consumption and improve speed Jun 12, 2026
@chensuyue

Copy link
Copy Markdown
Contributor

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines

Copy link
Copy Markdown
Azure Pipelines successfully started running 1 pipeline(s).

@chensuyue

Copy link
Copy Markdown
Contributor

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines

Copy link
Copy Markdown
Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21 wenhuach21 changed the title Refine AutoScheme logic, reduce memory consumption and improve speed Refine AutoScheme logic for gguf, reduce memory consumption and improve speed Jun 12, 2026
@chensuyue

Copy link
Copy Markdown
Contributor

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines

Copy link
Copy Markdown
Azure Pipelines successfully started running 1 pipeline(s).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants