File size: 19,577 Bytes
f8ba0eb
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
2025-08-20 23:25:42 - INFO - Loading model: LiquidAI/LFM2-VL-1.6B
2025-08-20 23:25:44 - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
2025-08-20 23:25:50 - INFO - Model loaded in 7.25 seconds
2025-08-20 23:25:50 - INFO - GPU Memory Usage after model load: 3023.64 MB
2025-08-20 23:26:55 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_001.mp4'
2025-08-20 23:26:55 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] Video saved to temporary file: temp_videos/e83cd3f1-609c-4419-b86e-463266ac54ce.mp4
2025-08-20 23:26:55 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:27:00 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:27:00 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] 30 frames saved to temp_videos/e83cd3f1-609c-4419-b86e-463266ac54ce
2025-08-20 23:27:00 - INFO - Prompt token length: 3604
2025-08-20 23:27:20 - INFO - Tokens per second: 43.03910315479847, Peak GPU memory MB: 9378.375
2025-08-20 23:27:20 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] Inference time: 25.01 seconds, CPU usage: 40.1%, CPU core utilization: [36.1, 41.1, 37.2, 46.1]
2025-08-20 23:27:20 - INFO - [e83cd3f1-609c-4419-b86e-463266ac54ce] Cleaned up temporary frame directory: temp_videos/e83cd3f1-609c-4419-b86e-463266ac54ce
2025-08-20 23:27:20 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_002.mp4'
2025-08-20 23:27:20 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] Video saved to temporary file: temp_videos/09fa6c2e-50b5-4c0a-ab72-c399a68e3b19.mp4
2025-08-20 23:27:20 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:27:25 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:27:25 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] 30 frames saved to temp_videos/09fa6c2e-50b5-4c0a-ab72-c399a68e3b19
2025-08-20 23:27:25 - INFO - Prompt token length: 3604
2025-08-20 23:27:47 - INFO - Tokens per second: 42.95401647014546, Peak GPU memory MB: 9378.375
2025-08-20 23:27:47 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] Inference time: 27.03 seconds, CPU usage: 39.6%, CPU core utilization: [61.4, 39.9, 35.3, 21.8]
2025-08-20 23:27:47 - INFO - [09fa6c2e-50b5-4c0a-ab72-c399a68e3b19] Cleaned up temporary frame directory: temp_videos/09fa6c2e-50b5-4c0a-ab72-c399a68e3b19
2025-08-20 23:27:47 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_003.mp4'
2025-08-20 23:27:47 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] Video saved to temporary file: temp_videos/d2c64140-0f5b-4e4a-83b7-feabd7c4323d.mp4
2025-08-20 23:27:47 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:27:52 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:27:52 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] 30 frames saved to temp_videos/d2c64140-0f5b-4e4a-83b7-feabd7c4323d
2025-08-20 23:27:52 - INFO - Prompt token length: 3604
2025-08-20 23:28:09 - INFO - Tokens per second: 43.51874005006489, Peak GPU memory MB: 9378.375
2025-08-20 23:28:09 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] Inference time: 22.02 seconds, CPU usage: 39.8%, CPU core utilization: [81.6, 22.0, 31.3, 24.0]
2025-08-20 23:28:09 - INFO - [d2c64140-0f5b-4e4a-83b7-feabd7c4323d] Cleaned up temporary frame directory: temp_videos/d2c64140-0f5b-4e4a-83b7-feabd7c4323d
2025-08-20 23:28:09 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_004.mp4'
2025-08-20 23:28:09 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] Video saved to temporary file: temp_videos/d27c71e9-88ae-44b1-834e-64d54e1645f9.mp4
2025-08-20 23:28:09 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:28:14 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:28:14 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] 30 frames saved to temp_videos/d27c71e9-88ae-44b1-834e-64d54e1645f9
2025-08-20 23:28:14 - INFO - Prompt token length: 3604
2025-08-20 23:28:32 - INFO - Tokens per second: 42.80125306128213, Peak GPU memory MB: 9378.375
2025-08-20 23:28:32 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] Inference time: 22.93 seconds, CPU usage: 39.8%, CPU core utilization: [19.9, 21.9, 21.1, 96.1]
2025-08-20 23:28:32 - INFO - [d27c71e9-88ae-44b1-834e-64d54e1645f9] Cleaned up temporary frame directory: temp_videos/d27c71e9-88ae-44b1-834e-64d54e1645f9
2025-08-20 23:28:32 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_005.mp4'
2025-08-20 23:28:32 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] Video saved to temporary file: temp_videos/16164179-73f7-4aa7-a42b-e6453e0f48af.mp4
2025-08-20 23:28:32 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:28:37 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:28:37 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] 30 frames saved to temp_videos/16164179-73f7-4aa7-a42b-e6453e0f48af
2025-08-20 23:28:37 - INFO - Prompt token length: 3604
2025-08-20 23:28:57 - INFO - Tokens per second: 42.835329663650555, Peak GPU memory MB: 9378.375
2025-08-20 23:28:57 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] Inference time: 24.90 seconds, CPU usage: 38.4%, CPU core utilization: [21.2, 37.7, 18.6, 76.1]
2025-08-20 23:28:57 - INFO - [16164179-73f7-4aa7-a42b-e6453e0f48af] Cleaned up temporary frame directory: temp_videos/16164179-73f7-4aa7-a42b-e6453e0f48af
2025-08-20 23:28:57 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_006.mp4'
2025-08-20 23:28:57 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] Video saved to temporary file: temp_videos/371792c3-0d46-4934-8417-8fb6f5b7e4c2.mp4
2025-08-20 23:28:57 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:29:02 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:29:02 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] 30 frames saved to temp_videos/371792c3-0d46-4934-8417-8fb6f5b7e4c2
2025-08-20 23:29:02 - INFO - Prompt token length: 3604
2025-08-20 23:29:21 - INFO - Tokens per second: 43.349566710658124, Peak GPU memory MB: 9378.375
2025-08-20 23:29:21 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] Inference time: 24.44 seconds, CPU usage: 39.0%, CPU core utilization: [45.2, 19.7, 70.7, 20.1]
2025-08-20 23:29:21 - INFO - [371792c3-0d46-4934-8417-8fb6f5b7e4c2] Cleaned up temporary frame directory: temp_videos/371792c3-0d46-4934-8417-8fb6f5b7e4c2
2025-08-20 23:29:21 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_007.mp4'
2025-08-20 23:29:21 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] Video saved to temporary file: temp_videos/fe4d9541-064c-4c8d-bb08-c1d347420c33.mp4
2025-08-20 23:29:21 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:29:26 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:29:26 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] 30 frames saved to temp_videos/fe4d9541-064c-4c8d-bb08-c1d347420c33
2025-08-20 23:29:27 - INFO - Prompt token length: 3604
2025-08-20 23:29:43 - INFO - Tokens per second: 43.652183023369325, Peak GPU memory MB: 9378.375
2025-08-20 23:29:43 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] Inference time: 21.99 seconds, CPU usage: 39.8%, CPU core utilization: [80.7, 22.9, 34.6, 21.1]
2025-08-20 23:29:43 - INFO - [fe4d9541-064c-4c8d-bb08-c1d347420c33] Cleaned up temporary frame directory: temp_videos/fe4d9541-064c-4c8d-bb08-c1d347420c33
2025-08-20 23:29:43 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_008.mp4'
2025-08-20 23:29:43 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] Video saved to temporary file: temp_videos/e5947bb6-739e-4bc1-bbe4-9ab58dc731d6.mp4
2025-08-20 23:29:43 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:29:48 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:29:48 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] 30 frames saved to temp_videos/e5947bb6-739e-4bc1-bbe4-9ab58dc731d6
2025-08-20 23:29:49 - INFO - Prompt token length: 3604
2025-08-20 23:30:08 - INFO - Tokens per second: 42.780439620939916, Peak GPU memory MB: 9378.375
2025-08-20 23:30:08 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] Inference time: 24.21 seconds, CPU usage: 41.1%, CPU core utilization: [62.1, 38.6, 41.8, 21.9]
2025-08-20 23:30:08 - INFO - [e5947bb6-739e-4bc1-bbe4-9ab58dc731d6] Cleaned up temporary frame directory: temp_videos/e5947bb6-739e-4bc1-bbe4-9ab58dc731d6
2025-08-20 23:30:08 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] Received new video inference request. Prompt: 'Please describe the video in detail, only focus on customer and staff behavior and activities and do not overly describe the static scene.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_009.mp4'
2025-08-20 23:30:08 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] Video saved to temporary file: temp_videos/b8e6aa49-574c-447b-b3ef-fd02c306e746.mp4
2025-08-20 23:30:08 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:30:13 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:30:13 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] 30 frames saved to temp_videos/b8e6aa49-574c-447b-b3ef-fd02c306e746
2025-08-20 23:30:13 - INFO - Prompt token length: 3604
2025-08-20 23:30:33 - INFO - Tokens per second: 43.12406437638801, Peak GPU memory MB: 9378.375
2025-08-20 23:30:33 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] Inference time: 25.30 seconds, CPU usage: 38.6%, CPU core utilization: [18.8, 47.6, 19.3, 68.7]
2025-08-20 23:30:33 - INFO - [b8e6aa49-574c-447b-b3ef-fd02c306e746] Cleaned up temporary frame directory: temp_videos/b8e6aa49-574c-447b-b3ef-fd02c306e746
2025-08-20 23:32:24 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] Received new video inference request. Prompt: 'Summarize the key events in this convenience store video. Focus only on the actions and interactions of the people. Avoid repetitive descriptions of the store's layout or shelves.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_001.mp4'
2025-08-20 23:32:24 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] Video saved to temporary file: temp_videos/f04edbef-149d-425b-8805-113e2ea54029.mp4
2025-08-20 23:32:24 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:32:29 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:32:29 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] 30 frames saved to temp_videos/f04edbef-149d-425b-8805-113e2ea54029
2025-08-20 23:32:30 - INFO - Prompt token length: 3613
2025-08-20 23:32:46 - INFO - Tokens per second: 43.735840716197266, Peak GPU memory MB: 9378.375
2025-08-20 23:32:46 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] Inference time: 22.10 seconds, CPU usage: 8.4%, CPU core utilization: [9.6, 5.9, 10.7, 7.5]
2025-08-20 23:32:46 - INFO - [f04edbef-149d-425b-8805-113e2ea54029] Cleaned up temporary frame directory: temp_videos/f04edbef-149d-425b-8805-113e2ea54029
2025-08-20 23:32:46 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] Received new video inference request. Prompt: 'Summarize the key events in this convenience store video. Focus only on the actions and interactions of the people. Avoid repetitive descriptions of the store's layout or shelves.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_002.mp4'
2025-08-20 23:32:46 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] Video saved to temporary file: temp_videos/a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2.mp4
2025-08-20 23:32:46 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:32:51 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:32:51 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] 30 frames saved to temp_videos/a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2
2025-08-20 23:32:52 - INFO - Prompt token length: 3613
2025-08-20 23:33:09 - INFO - Tokens per second: 43.61585189482321, Peak GPU memory MB: 9378.375
2025-08-20 23:33:09 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] Inference time: 22.20 seconds, CPU usage: 41.2%, CPU core utilization: [40.5, 22.9, 75.8, 25.4]
2025-08-20 23:33:09 - INFO - [a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2] Cleaned up temporary frame directory: temp_videos/a0f9ff7d-76c8-42a2-ac7f-8064d56ae6f2
2025-08-20 23:33:09 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] Received new video inference request. Prompt: 'Summarize the key events in this convenience store video. Focus only on the actions and interactions of the people. Avoid repetitive descriptions of the store's layout or shelves.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_003.mp4'
2025-08-20 23:33:09 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] Video saved to temporary file: temp_videos/2f221dc2-a54d-4d32-8900-2b5c046cebaf.mp4
2025-08-20 23:33:09 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:33:14 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:33:14 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] 30 frames saved to temp_videos/2f221dc2-a54d-4d32-8900-2b5c046cebaf
2025-08-20 23:33:14 - INFO - Prompt token length: 3613
2025-08-20 23:33:31 - INFO - Tokens per second: 43.7656086803256, Peak GPU memory MB: 9378.375
2025-08-20 23:33:31 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] Inference time: 21.94 seconds, CPU usage: 40.5%, CPU core utilization: [37.9, 46.5, 22.3, 55.2]
2025-08-20 23:33:31 - INFO - [2f221dc2-a54d-4d32-8900-2b5c046cebaf] Cleaned up temporary frame directory: temp_videos/2f221dc2-a54d-4d32-8900-2b5c046cebaf
2025-08-20 23:33:31 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] Received new video inference request. Prompt: 'Summarize the key events in this convenience store video. Focus only on the actions and interactions of the people. Avoid repetitive descriptions of the store's layout or shelves.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_004.mp4'
2025-08-20 23:33:31 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] Video saved to temporary file: temp_videos/a0b144b5-c0eb-4c16-8826-7047eed0dbed.mp4
2025-08-20 23:33:31 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:33:35 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:33:35 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] 30 frames saved to temp_videos/a0b144b5-c0eb-4c16-8826-7047eed0dbed
2025-08-20 23:33:36 - INFO - Prompt token length: 3613
2025-08-20 23:33:52 - INFO - Tokens per second: 43.67951928934803, Peak GPU memory MB: 9378.375
2025-08-20 23:33:52 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] Inference time: 21.80 seconds, CPU usage: 40.2%, CPU core utilization: [67.0, 20.9, 50.2, 22.5]
2025-08-20 23:33:52 - INFO - [a0b144b5-c0eb-4c16-8826-7047eed0dbed] Cleaned up temporary frame directory: temp_videos/a0b144b5-c0eb-4c16-8826-7047eed0dbed
2025-08-20 23:33:52 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] Received new video inference request. Prompt: 'Summarize the key events in this convenience store video. Focus only on the actions and interactions of the people. Avoid repetitive descriptions of the store's layout or shelves.', Video: '/mnt/data/xiuying/Code/local_deploy/video/Clips_60s/sample_part_005.mp4'
2025-08-20 23:33:52 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] Video saved to temporary file: temp_videos/526b7643-3bfd-4e06-91c3-b91651e42819.mp4
2025-08-20 23:33:52 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] Extracting frames using method: uniform, rate/threshold: 30
2025-08-20 23:33:57 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] Extracted 30 frames successfully. Saving to temporary files...
2025-08-20 23:33:57 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] 30 frames saved to temp_videos/526b7643-3bfd-4e06-91c3-b91651e42819
2025-08-20 23:33:58 - INFO - Prompt token length: 3613
2025-08-20 23:34:15 - INFO - Tokens per second: 43.287042978207154, Peak GPU memory MB: 9378.375
2025-08-20 23:34:15 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] Inference time: 22.61 seconds, CPU usage: 40.3%, CPU core utilization: [21.9, 27.2, 21.6, 90.4]
2025-08-20 23:34:15 - INFO - [526b7643-3bfd-4e06-91c3-b91651e42819] Cleaned up temporary frame directory: temp_videos/526b7643-3bfd-4e06-91c3-b91651e42819