############################################################
CVE Loading Benchmark Study
############################################################
Date: 2025-11-26T11:41:13.156502
Testing methods: [1, 2]
Dataset sizes: [50, 100, 500]
============================================================
Testing with limit=50 CVEs
============================================================
>>> Testing Method 1: GitHub API (limit=50)
==================================================
CVE Data Loader
==================================================
Database initialized at: benchmarks/test_github_api.db
Get CVE list for year 2024...
Scanning 0xxx...
Found 50 CVE files to process
Processing: 10/50
Processing: 20/50
Processing: 30/50
Processing: 40/50
Processing: 50/50
==================================================
Loading complete!
✓ Loaded: 50 CVEs
✗ Errors: 0
==================================================
============================================================
Method: GitHub API (Individual Files)
============================================================
Duration: 7.93 seconds (0.13 minutes)
CVEs Loaded: 50
Errors: 0
Speed: 6.31 CVEs/second
DB Size: 0.05 MB
Bytes per CVE: 983.04 bytes
>>> Testing Method 2: Git Clone + Local Parse (limit=50)
Cloning CVE repository (this may take a few minutes)...
Found 38781 CVE files for 2024
============================================================
Method: Git Clone (Local Parsing)
============================================================
Duration: 297.90 seconds (4.96 minutes)
CVEs Loaded: 50
Errors: 0
Speed: 0.17 CVEs/second
DB Size: 0.06 MB
Bytes per CVE: 1228.80 bytes
============================================================
Testing with limit=100 CVEs
============================================================
>>> Testing Method 1: GitHub API (limit=100)
==================================================
CVE Data Loader
==================================================
Database initialized at: benchmarks/test_github_api.db
Get CVE list for year 2024...
Scanning 0xxx...
Found 100 CVE files to process
Processing: 10/100
Processing: 20/100
Processing: 30/100
Processing: 40/100
Processing: 50/100
Processing: 60/100
Processing: 70/100
Processing: 80/100
Processing: 90/100
Processing: 100/100
==================================================
Loading complete!
✓ Loaded: 100 CVEs
✗ Errors: 0
==================================================
============================================================
Method: GitHub API (Individual Files)
============================================================
Duration: 13.28 seconds (0.22 minutes)
CVEs Loaded: 100
Errors: 0
Speed: 7.53 CVEs/second
DB Size: 0.07 MB
Bytes per CVE: 696.32 bytes
>>> Testing Method 2: Git Clone + Local Parse (limit=100)
Found 38781 CVE files for 2024
============================================================
Method: Git Clone (Local Parsing)
============================================================
Duration: 0.16 seconds (0.00 minutes)
CVEs Loaded: 100
Errors: 0
Speed: 639.53 CVEs/second
DB Size: 0.09 MB
Bytes per CVE: 983.04 bytes
============================================================
Testing with limit=500 CVEs
============================================================
>>> Testing Method 1: GitHub API (limit=500)
==================================================
CVE Data Loader
==================================================
Database initialized at: benchmarks/test_github_api.db
Get CVE list for year 2024...
Scanning 0xxx...
Found 500 CVE files to process
Processing: 10/500
Processing: 20/500
Processing: 30/500
Processing: 40/500
Processing: 50/500
Processing: 60/500
Processing: 70/500
Processing: 80/500
Processing: 90/500
Processing: 100/500
Processing: 110/500
Processing: 120/500
Processing: 130/500
Processing: 140/500
Processing: 150/500
Processing: 160/500
Processing: 170/500
Processing: 180/500
Processing: 190/500
Processing: 200/500
Processing: 210/500
Processing: 220/500
Processing: 230/500
Processing: 240/500
Processing: 250/500
Processing: 260/500
Processing: 270/500
Processing: 280/500
Processing: 290/500
Processing: 300/500
Processing: 310/500
Processing: 320/500
Processing: 330/500
Processing: 340/500
Processing: 350/500
Processing: 360/500
Processing: 370/500
Processing: 380/500
Processing: 390/500
Processing: 400/500
Processing: 410/500
Processing: 420/500
Processing: 430/500
Processing: 440/500
Processing: 450/500
Processing: 460/500
Processing: 470/500
Processing: 480/500
Processing: 490/500
Processing: 500/500
==================================================
Loading complete!
✓ Loaded: 500 CVEs
✗ Errors: 0
==================================================
============================================================
Method: GitHub API (Individual Files)
============================================================
Duration: 62.51 seconds (1.04 minutes)
CVEs Loaded: 500
Errors: 0
Speed: 8.00 CVEs/second
DB Size: 0.32 MB
Bytes per CVE: 679.94 bytes
>>> Testing Method 2: Git Clone + Local Parse (limit=500)
Found 38781 CVE files for 2024
============================================================
Method: Git Clone (Local Parsing)
============================================================
Duration: 0.18 seconds (0.00 minutes)
CVEs Loaded: 500
Errors: 0
Speed: 2735.44 CVEs/second
DB Size: 0.39 MB
Bytes per CVE: 827.39 bytes
############################################################
SUMMARY COMPARISON
############################################################
Method CVEs Time(s) Speed Size(MB)
------------------------------------------------------------------------------------------
GitHub API (Individual Files) 50 7.93 6.31 0.05
Git Clone (Local Parsing) 50 297.90 0.17 0.06
GitHub API (Individual Files) 100 13.28 7.53 0.07
Git Clone (Local Parsing) 100 0.16 639.53 0.09
GitHub API (Individual Files) 500 62.51 8.00 0.32
Git Clone (Local Parsing) 500 0.18 2735.44 0.39
############################################################
RECOMMENDATIONS
############################################################
✓ Fastest method: Git Clone (Local Parsing)
Speed: 2735.44 CVEs/second
✓ Most space-efficient: GitHub API (Individual Files)
Size per CVE: 679.94 bytes
############################################################
FULL DATASET PROJECTIONS (240,000 CVEs)
############################################################
GitHub API (Individual Files):
Estimated time: 531.27 minutes (8.85 hours)
Estimated size: 159.38 MB (0.16 GB)
Git Clone (Local Parsing):
Estimated time: 6.25 minutes (0.10 hours)
Estimated size: 225.00 MB (0.22 GB)
GitHub API (Individual Files):
Estimated time: 500.04 minutes (8.33 hours)
Estimated size: 155.62 MB (0.15 GB)
Git Clone (Local Parsing):
Estimated time: 1.46 minutes (0.02 hours)
Estimated size: 189.38 MB (0.18 GB)