LeanQuant commited on
Commit
e78d713
·
verified ·
1 Parent(s): 195a01d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -49,6 +49,7 @@ This is a **DFloat11 losslessly compressed** version of the original `Qwen/Qwen-
49
  def parse_args():
50
  parser = argparse.ArgumentParser(description='Generate images using Qwen-Image model')
51
  parser.add_argument('--cpu_offload', action='store_true', help='Enable CPU offloading')
 
52
  parser.add_argument('--no_pin_memory', action='store_true', help='Disable memory pinning')
53
  parser.add_argument('--prompt', type=str, default='A coffee shop entrance features a chalkboard sign reading "Qwen Coffee 😊 $2 per cup," with a neon light beside it displaying "通义千问". Next to it hangs a poster showing a beautiful Chinese woman, and beneath the poster is written "π≈3.1415926-53589793-23846264-33832795-02384197".',
54
  help='Text prompt for image generation')
@@ -83,6 +84,7 @@ This is a **DFloat11 losslessly compressed** version of the original `Qwen/Qwen-
83
  "DFloat11/Qwen-Image-DF11",
84
  device="cpu",
85
  cpu_offload=args.cpu_offload,
 
86
  pin_memory=not args.no_pin_memory,
87
  bfloat16_model=transformer,
88
  )
@@ -136,8 +138,12 @@ This is a **DFloat11 losslessly compressed** version of the original `Qwen/Qwen-
136
  python qwen_image.py --cpu_offload
137
  ```
138
 
139
- If you are getting out-of-memory errors, try disabling memory-pinning:
140
  ```bash
 
 
 
 
141
  python qwen_image.py --cpu_offload --no_pin_memory
142
  ```
143
 
 
49
  def parse_args():
50
  parser = argparse.ArgumentParser(description='Generate images using Qwen-Image model')
51
  parser.add_argument('--cpu_offload', action='store_true', help='Enable CPU offloading')
52
+ parser.add_argument('--cpu_offload_blocks', type=int, default=None, help='Number of transformer blocks to offload to CPU')
53
  parser.add_argument('--no_pin_memory', action='store_true', help='Disable memory pinning')
54
  parser.add_argument('--prompt', type=str, default='A coffee shop entrance features a chalkboard sign reading "Qwen Coffee 😊 $2 per cup," with a neon light beside it displaying "通义千问". Next to it hangs a poster showing a beautiful Chinese woman, and beneath the poster is written "π≈3.1415926-53589793-23846264-33832795-02384197".',
55
  help='Text prompt for image generation')
 
84
  "DFloat11/Qwen-Image-DF11",
85
  device="cpu",
86
  cpu_offload=args.cpu_offload,
87
+ cpu_offload_blocks=args.cpu_offload_blocks,
88
  pin_memory=not args.no_pin_memory,
89
  bfloat16_model=transformer,
90
  )
 
138
  python qwen_image.py --cpu_offload
139
  ```
140
 
141
+ If you are getting out-of-CPU-memory errors, try limiting the number of offloaded blocks or disabling memory-pinning:
142
  ```bash
143
+ # Offload only 16 blocks (offloading more blocks uses less GPU memory and more CPU memory; offloading less blocks is faster):
144
+ python qwen_image.py --cpu_offload --cpu_offload_blocks 16
145
+
146
+ # Disable memory-pinning (the most memory efficient way, but could be slower):
147
  python qwen_image.py --cpu_offload --no_pin_memory
148
  ```
149