Merge updates from upstream and integrate local security report optimizations by hbui290 · Pull Request #163 · NVIDIA/SkillSpector

hbui290 · 2026-06-23T01:33:31Z

This Pull Request merges the upstream changes from NVIDIA/SkillSpector into the codebase and integrates custom enhancements:

Resolved Merge Conflicts: Handled conflicts across core files like llm_analyzer_base.py, report.py, meta_analyzer.py, build_context.py, and test suites.
Custom Scoring Multiplier: Ensured the 1.3x risk scoring multiplier is correctly applied only to findings detected within executable components (rather than scaling the entire report score), combined with the upstream diminishing returns logic.
Optimizations: Addressed edge-case issues in mock initializations during unit tests.

All 725+ tests pass successfully.

… pre-filtering, and bug fixes

… of substring match

…idators

… in optimization test

rng1995

Verdict: Request changes (strong) — a 19-file upstream-merge grab-bag with some genuinely good ideas (SQLite cache, block-aware chunking, retry/backoff), but several blocking problems including a fail-open detection regression.

Blocking

Detection-coverage regression (fail-open). _SKIP_DIRS in src/skillspector/nodes/build_context.py is expanded to include docs, doc, tests, test, spec, specs, build, dist, out, target, images, media, brand (~L775-808), now matched against the relative path (~L884). A malicious skill can hide a payload in e.g. tests/ or docs/ and evade scanning entirely. For a security scanner this is a serious regression — these must not be skipped by default. (The bundled BAO_CAO_TOI_UU_HOA.md even frames this as fixing a "file omission" bug, but the net effect is more directories silently skipped.)
Committed personal/local artifacts. BAO_CAO_TOI_UU_HOA.md and HUONG_DAN_SU_DUNG.md are non-project docs containing dev-machine absolute paths (file:///Users/winston/.gemini/antigravity-ide/scratch/SkillSpector/...). These should not land in the repo.
load_dotenv() at package import in src/skillspector/__init__.py (~L135-136): an import-time side effect that auto-loads .env from CWD. Risky for a tool run against untrusted skill directories — it could ingest a scanned skill's .env. Move it behind the CLI entrypoint, not package import.
Production code detecting test mocks. LLMAnalyzerBase._original_run_batches = LLMAnalyzerBase.run_batches (llm_analyzer_base.py ~L673) plus the is_patched mock-introspection branch in semantic_security_discovery.py (~L730-756) couple prod behavior to the test harness — a code smell that can mask real regressions.
Scope / conflicts. Bundles unrelated changes and overlaps/conflicts with #157 (binary skip), #159/#164/#116 (zip-slip/ingest, _download_file/_extract_zip), and #142/#153/#122 (_compute_risk_score / scoring + report.py). Please split into focused PRs and rebase.

The good ideas (persistent SQLite cache, smarter chunk_file_by_lines, deterministic finding ordering, retry/backoff) are worth landing — just on their own focused PRs and without the _SKIP_DIRS expansion, the committed local docs, and the import-time load_dotenv().

Winston added 8 commits June 22, 2026 10:21

feat: optimize performance with SQLite caching, smart block chunking,…

04ae8ef

… pre-filtering, and bug fixes

security: prevent Zip Slip vulnerability in zip extractor

976dc8a

security: resolve import aliases in AST analyzer to prevent evasion

6614c12

security: fix trusted domain verification by parsing hostname instead…

ce2c8c7

… of substring match

fix: drop ge/le Pydantic bounds and enforce with runtime clamping val…

15610fe

…idators

fix: apply scoring multiplier only to findings in executable files

df4a199

Merge upstream/main into main, resolving conflicts

83f4bcb

Fix executable scripts multiplier propagation and mock get_chat_model…

6d29aa6

… in optimization test

This was referenced Jun 23, 2026

fix(input-handler): validate URLs against SSRF and add zip-slip protection #159

Merged

feat(report): add analysis_completeness field to JSON output #160

Merged

rng1995 requested changes Jun 23, 2026

View reviewed changes

rng1995 mentioned this pull request Jun 23, 2026

fix(input-handler): bound URL, zip, and git ingest paths #164

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge updates from upstream and integrate local security report optimizations#163

Merge updates from upstream and integrate local security report optimizations#163
hbui290 wants to merge 8 commits into
NVIDIA:mainfrom
hbui290:main

hbui290 commented Jun 23, 2026

Uh oh!

rng1995 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hbui290 commented Jun 23, 2026

Uh oh!

rng1995 left a comment

Choose a reason for hiding this comment

Blocking

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants