File size: 22,882 Bytes
f8ba0eb |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 |
2025-08-21 01:42:04 - INFO - Loading model: Qwen/Qwen2.5-VL-3B-Instruct-AWQ
2025-08-21 01:42:09 - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
2025-08-21 01:42:40 - INFO - Model loaded in 35.77 seconds
2025-08-21 01:42:40 - INFO - GPU Memory Usage after model load: 3250.55 MB
2025-08-21 02:54:09 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_001.mp4'
2025-08-21 02:54:09 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] Video saved to temporary file: temp_videos/c40f2273-a9f5-4d96-82d4-990269ab9708.mp4
2025-08-21 02:54:09 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:54:13 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:54:13 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] 30 frames saved to temp_videos/c40f2273-a9f5-4d96-82d4-990269ab9708
2025-08-21 02:54:13 - INFO - Prompt token length: 2306
2025-08-21 02:54:23 - INFO - Tokens per second: 11.859020159952623, Peak GPU memory MB: 5350.375
2025-08-21 02:54:23 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] Inference time: 14.12 seconds, CPU usage: 2.0%, CPU core utilization: [2.0, 2.0, 1.9, 1.9]
2025-08-21 02:54:23 - INFO - [c40f2273-a9f5-4d96-82d4-990269ab9708] Cleaned up temporary frame directory: temp_videos/c40f2273-a9f5-4d96-82d4-990269ab9708
2025-08-21 02:54:23 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_002.mp4'
2025-08-21 02:54:23 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] Video saved to temporary file: temp_videos/1bbf302e-4b0b-4363-bddd-3fb826552587.mp4
2025-08-21 02:54:23 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:54:27 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:54:27 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] 30 frames saved to temp_videos/1bbf302e-4b0b-4363-bddd-3fb826552587
2025-08-21 02:54:27 - INFO - Prompt token length: 2306
2025-08-21 02:54:34 - INFO - Tokens per second: 12.033912631174916, Peak GPU memory MB: 5350.375
2025-08-21 02:54:34 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] Inference time: 10.49 seconds, CPU usage: 44.1%, CPU core utilization: [80.0, 27.5, 40.1, 29.0]
2025-08-21 02:54:34 - INFO - [1bbf302e-4b0b-4363-bddd-3fb826552587] Cleaned up temporary frame directory: temp_videos/1bbf302e-4b0b-4363-bddd-3fb826552587
2025-08-21 02:54:34 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_003.mp4'
2025-08-21 02:54:34 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] Video saved to temporary file: temp_videos/48b38709-fb9f-4c1d-9db6-279fea58e01f.mp4
2025-08-21 02:54:34 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:54:37 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:54:37 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] 30 frames saved to temp_videos/48b38709-fb9f-4c1d-9db6-279fea58e01f
2025-08-21 02:54:37 - INFO - Prompt token length: 2306
2025-08-21 02:54:45 - INFO - Tokens per second: 11.980873759204092, Peak GPU memory MB: 5350.375
2025-08-21 02:54:45 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] Inference time: 10.84 seconds, CPU usage: 43.7%, CPU core utilization: [49.1, 32.4, 65.3, 27.8]
2025-08-21 02:54:45 - INFO - [48b38709-fb9f-4c1d-9db6-279fea58e01f] Cleaned up temporary frame directory: temp_videos/48b38709-fb9f-4c1d-9db6-279fea58e01f
2025-08-21 02:54:45 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_004.mp4'
2025-08-21 02:54:45 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] Video saved to temporary file: temp_videos/218b6cb4-0c13-4223-b6be-fbc881774b17.mp4
2025-08-21 02:54:45 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:54:48 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:54:48 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] 30 frames saved to temp_videos/218b6cb4-0c13-4223-b6be-fbc881774b17
2025-08-21 02:54:48 - INFO - Prompt token length: 2306
2025-08-21 02:55:13 - INFO - Tokens per second: 11.894932301505968, Peak GPU memory MB: 5350.375
2025-08-21 02:55:13 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] Inference time: 27.98 seconds, CPU usage: 33.8%, CPU core utilization: [13.9, 45.9, 13.3, 61.9]
2025-08-21 02:55:13 - INFO - [218b6cb4-0c13-4223-b6be-fbc881774b17] Cleaned up temporary frame directory: temp_videos/218b6cb4-0c13-4223-b6be-fbc881774b17
2025-08-21 02:55:13 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_005.mp4'
2025-08-21 02:55:13 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] Video saved to temporary file: temp_videos/6550b43c-430e-4dee-8467-1a05b4c082cd.mp4
2025-08-21 02:55:13 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:55:16 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:55:16 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] 30 frames saved to temp_videos/6550b43c-430e-4dee-8467-1a05b4c082cd
2025-08-21 02:55:16 - INFO - Prompt token length: 2306
2025-08-21 02:55:25 - INFO - Tokens per second: 11.99842860278374, Peak GPU memory MB: 5350.375
2025-08-21 02:55:25 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] Inference time: 12.41 seconds, CPU usage: 40.7%, CPU core utilization: [34.0, 38.6, 64.3, 25.9]
2025-08-21 02:55:25 - INFO - [6550b43c-430e-4dee-8467-1a05b4c082cd] Cleaned up temporary frame directory: temp_videos/6550b43c-430e-4dee-8467-1a05b4c082cd
2025-08-21 02:55:25 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_006.mp4'
2025-08-21 02:55:25 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] Video saved to temporary file: temp_videos/172a602d-213b-41d6-b892-e7ca06e535bc.mp4
2025-08-21 02:55:25 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:55:28 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:55:28 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] 30 frames saved to temp_videos/172a602d-213b-41d6-b892-e7ca06e535bc
2025-08-21 02:55:29 - INFO - Prompt token length: 2306
2025-08-21 02:55:40 - INFO - Tokens per second: 11.862422969421846, Peak GPU memory MB: 5350.375
2025-08-21 02:55:40 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] Inference time: 15.04 seconds, CPU usage: 39.3%, CPU core utilization: [21.5, 43.6, 21.6, 70.6]
2025-08-21 02:55:40 - INFO - [172a602d-213b-41d6-b892-e7ca06e535bc] Cleaned up temporary frame directory: temp_videos/172a602d-213b-41d6-b892-e7ca06e535bc
2025-08-21 02:55:40 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_007.mp4'
2025-08-21 02:55:40 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] Video saved to temporary file: temp_videos/082b484d-e219-4cde-ac8e-8af5b8f380cd.mp4
2025-08-21 02:55:40 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:55:43 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:55:43 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] 30 frames saved to temp_videos/082b484d-e219-4cde-ac8e-8af5b8f380cd
2025-08-21 02:55:44 - INFO - Prompt token length: 2306
2025-08-21 02:55:52 - INFO - Tokens per second: 12.007495276914103, Peak GPU memory MB: 5350.375
2025-08-21 02:55:52 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] Inference time: 11.83 seconds, CPU usage: 42.8%, CPU core utilization: [60.5, 34.4, 49.7, 26.5]
2025-08-21 02:55:52 - INFO - [082b484d-e219-4cde-ac8e-8af5b8f380cd] Cleaned up temporary frame directory: temp_videos/082b484d-e219-4cde-ac8e-8af5b8f380cd
2025-08-21 02:55:52 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_008.mp4'
2025-08-21 02:55:52 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] Video saved to temporary file: temp_videos/d4aec199-0b7e-4058-b8ba-bdfbb7806fca.mp4
2025-08-21 02:55:52 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:55:55 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:55:55 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] 30 frames saved to temp_videos/d4aec199-0b7e-4058-b8ba-bdfbb7806fca
2025-08-21 02:55:56 - INFO - Prompt token length: 2306
2025-08-21 02:56:04 - INFO - Tokens per second: 11.871294681994929, Peak GPU memory MB: 5350.375
2025-08-21 02:56:04 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] Inference time: 12.13 seconds, CPU usage: 43.3%, CPU core utilization: [35.9, 32.5, 78.1, 26.8]
2025-08-21 02:56:04 - INFO - [d4aec199-0b7e-4058-b8ba-bdfbb7806fca] Cleaned up temporary frame directory: temp_videos/d4aec199-0b7e-4058-b8ba-bdfbb7806fca
2025-08-21 02:56:04 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_009.mp4'
2025-08-21 02:56:04 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] Video saved to temporary file: temp_videos/20eacc2f-2a33-4211-b488-f449c4bbc64d.mp4
2025-08-21 02:56:04 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:56:07 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:56:07 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] 30 frames saved to temp_videos/20eacc2f-2a33-4211-b488-f449c4bbc64d
2025-08-21 02:56:08 - INFO - Prompt token length: 2306
2025-08-21 02:56:15 - INFO - Tokens per second: 11.63501242448262, Peak GPU memory MB: 5350.375
2025-08-21 02:56:15 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] Inference time: 10.73 seconds, CPU usage: 46.3%, CPU core utilization: [38.3, 58.2, 31.7, 56.7]
2025-08-21 02:56:15 - INFO - [20eacc2f-2a33-4211-b488-f449c4bbc64d] Cleaned up temporary frame directory: temp_videos/20eacc2f-2a33-4211-b488-f449c4bbc64d
2025-08-21 02:56:15 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_010.mp4'
2025-08-21 02:56:15 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] Video saved to temporary file: temp_videos/7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5.mp4
2025-08-21 02:56:15 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:56:18 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:56:18 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] 30 frames saved to temp_videos/7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5
2025-08-21 02:56:18 - INFO - Prompt token length: 2306
2025-08-21 02:56:31 - INFO - Tokens per second: 11.874488678953208, Peak GPU memory MB: 5350.375
2025-08-21 02:56:31 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] Inference time: 16.08 seconds, CPU usage: 37.8%, CPU core utilization: [19.6, 68.7, 18.4, 44.3]
2025-08-21 02:56:31 - INFO - [7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5] Cleaned up temporary frame directory: temp_videos/7bd61912-f2d8-49f3-a1d2-d25a5bb09ff5
2025-08-21 02:56:31 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_011.mp4'
2025-08-21 02:56:31 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] Video saved to temporary file: temp_videos/305ccf60-14df-466d-8565-f04265430ba1.mp4
2025-08-21 02:56:31 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:56:34 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:56:34 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] 30 frames saved to temp_videos/305ccf60-14df-466d-8565-f04265430ba1
2025-08-21 02:56:35 - INFO - Prompt token length: 2306
2025-08-21 02:56:44 - INFO - Tokens per second: 11.829041430743297, Peak GPU memory MB: 5350.375
2025-08-21 02:56:44 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] Inference time: 12.93 seconds, CPU usage: 42.0%, CPU core utilization: [28.8, 42.5, 25.9, 70.9]
2025-08-21 02:56:44 - INFO - [305ccf60-14df-466d-8565-f04265430ba1] Cleaned up temporary frame directory: temp_videos/305ccf60-14df-466d-8565-f04265430ba1
2025-08-21 02:56:44 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_012.mp4'
2025-08-21 02:56:44 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] Video saved to temporary file: temp_videos/659dc8e0-c40a-432f-887e-c9cdeefc17a4.mp4
2025-08-21 02:56:44 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:56:47 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:56:47 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] 30 frames saved to temp_videos/659dc8e0-c40a-432f-887e-c9cdeefc17a4
2025-08-21 02:56:48 - INFO - Prompt token length: 2306
2025-08-21 02:56:58 - INFO - Tokens per second: 11.928726359703456, Peak GPU memory MB: 5350.375
2025-08-21 02:56:58 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] Inference time: 13.75 seconds, CPU usage: 39.8%, CPU core utilization: [31.6, 62.3, 41.3, 23.8]
2025-08-21 02:56:58 - INFO - [659dc8e0-c40a-432f-887e-c9cdeefc17a4] Cleaned up temporary frame directory: temp_videos/659dc8e0-c40a-432f-887e-c9cdeefc17a4
2025-08-21 02:56:58 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_013.mp4'
2025-08-21 02:56:58 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] Video saved to temporary file: temp_videos/05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989.mp4
2025-08-21 02:56:58 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:57:01 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:57:01 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] 30 frames saved to temp_videos/05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989
2025-08-21 02:57:01 - INFO - Prompt token length: 2306
2025-08-21 02:57:07 - INFO - Tokens per second: 12.014726651428436, Peak GPU memory MB: 5350.375
2025-08-21 02:57:07 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] Inference time: 9.37 seconds, CPU usage: 43.8%, CPU core utilization: [29.4, 29.6, 88.1, 27.9]
2025-08-21 02:57:07 - INFO - [05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989] Cleaned up temporary frame directory: temp_videos/05a4c1b9-d6d6-4e4e-a0f2-f58a0664c989
2025-08-21 02:57:07 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_014.mp4'
2025-08-21 02:57:07 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] Video saved to temporary file: temp_videos/0f5076d3-96af-4d28-be73-0db23c76eaf4.mp4
2025-08-21 02:57:07 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:57:10 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:57:10 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] 30 frames saved to temp_videos/0f5076d3-96af-4d28-be73-0db23c76eaf4
2025-08-21 02:57:11 - INFO - Prompt token length: 2306
2025-08-21 02:57:19 - INFO - Tokens per second: 11.861972979079045, Peak GPU memory MB: 5350.375
2025-08-21 02:57:19 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] Inference time: 11.61 seconds, CPU usage: 41.6%, CPU core utilization: [42.9, 26.4, 27.0, 69.9]
2025-08-21 02:57:19 - INFO - [0f5076d3-96af-4d28-be73-0db23c76eaf4] Cleaned up temporary frame directory: temp_videos/0f5076d3-96af-4d28-be73-0db23c76eaf4
2025-08-21 02:57:19 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_015.mp4'
2025-08-21 02:57:19 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] Video saved to temporary file: temp_videos/a60e4adc-5a10-496e-8dba-e95fa8204801.mp4
2025-08-21 02:57:19 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:57:22 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:57:22 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] 30 frames saved to temp_videos/a60e4adc-5a10-496e-8dba-e95fa8204801
2025-08-21 02:57:22 - INFO - Prompt token length: 2306
2025-08-21 02:57:31 - INFO - Tokens per second: 12.034885208983422, Peak GPU memory MB: 5350.375
2025-08-21 02:57:31 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] Inference time: 12.68 seconds, CPU usage: 39.5%, CPU core utilization: [59.8, 22.7, 53.7, 22.1]
2025-08-21 02:57:31 - INFO - [a60e4adc-5a10-496e-8dba-e95fa8204801] Cleaned up temporary frame directory: temp_videos/a60e4adc-5a10-496e-8dba-e95fa8204801
2025-08-21 02:57:31 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] Received new video inference request. Prompt: 'Summarize the key observable events in this 1-minute convenience store video clip. Focus strictly on the physical actions and interactions of the people. Describe only what you can see', Video: '/mnt/data/xiuying/Code/local_deploy/video/new/Clips_60s/video_part_016.mp4'
2025-08-21 02:57:31 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] Video saved to temporary file: temp_videos/262c15ae-e353-4d00-b508-c4d77d75300a.mp4
2025-08-21 02:57:31 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] Extracting frames using method: uniform, rate/threshold: 30
2025-08-21 02:57:35 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] Extracted 30 frames successfully. Saving to temporary files...
2025-08-21 02:57:35 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] 30 frames saved to temp_videos/262c15ae-e353-4d00-b508-c4d77d75300a
2025-08-21 02:57:35 - INFO - Prompt token length: 2306
2025-08-21 02:57:59 - INFO - Tokens per second: 12.052444962168167, Peak GPU memory MB: 5350.375
2025-08-21 02:57:59 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] Inference time: 27.71 seconds, CPU usage: 33.2%, CPU core utilization: [31.4, 17.1, 70.7, 13.4]
2025-08-21 02:57:59 - INFO - [262c15ae-e353-4d00-b508-c4d77d75300a] Cleaned up temporary frame directory: temp_videos/262c15ae-e353-4d00-b508-c4d77d75300a
|