Skip to content

Feat/test azure filestore metadata#1234

Merged
jschoedl merged 119 commits into
datacontract:mainfrom
dmaresma:feat/test_filestore_metadata
Jun 18, 2026
Merged

Feat/test azure filestore metadata#1234
jschoedl merged 119 commits into
datacontract:mainfrom
dmaresma:feat/test_filestore_metadata

Conversation

@dmaresma

Copy link
Copy Markdown
Contributor

related to #1227

  • [ x] Tests pass (uv run pytest)
  • [x ] Code formatted (uv run ruff check --fix && uv run ruff format)
  • README.md updated (if relevant)
  • [ x] CHANGELOG.md entry added

dmaresma and others added 30 commits January 24, 2026 19:32
…attributes add sort by schema.name and properties by ordinal_position
jschoedl added a commit to dmaresma/datacontract-cli that referenced this pull request Jun 10, 2026
Comment thread datacontract/engines/datacontract/check_azure_blob_file.py Dismissed
Comment thread datacontract/engines/datacontract/check_azure_blob_file.py Dismissed
jschoedl added a commit that referenced this pull request Jun 13, 2026
* init snowflake importer

* ruff format fix

* add documentation

* improve export query and fix ordered columns position, disable empty attributes add sort by schema.name and properties by ordinal_position

* inject account in server.account

* improve connector login

* ruff

* rename customProperties ordinalPosition

* refactor snowflake-db and add test_import_snowflake

* ruff lint applied

* typo during refactor

* force string, char as varchar for physicalType

* add columns tags, customProperties: precisition, scale, autoIncrement, defaultValue, Roles in server

* refactor

* turn snowflake tags as str Key=Value array

* added tests and key authentication

* add DMF as quality (schema & properties) based on last results

* merged upstream changes

* removed temp file

* fix bug

* typo on description

* test success

* double-quoted schema identifiers and authenticator documentation

* ruff format

* detect snowflake_authenticator in env. variable

* ruff format

* remove_unexpect_characters

* remove schema

* fix test failure

* implement server mandatory properties

* remove typo quote

* refactor use of the odcs_helper (improved)

Co-authored-by: Copilot <copilot@github.com>

* remove un used package

* linting format and fixes

Co-authored-by: Copilot <copilot@github.com>

* add property sort by ordinal position and schema sort by name

Co-authored-by: Copilot <copilot@github.com>

* run ruff format fix

* test fix

* improve physicalType extraction

* physicalType ruff format 'n check

* autoincrement wrong physicaltype format, Tags issues

* fix reinject unique property when IDENTITY found

* physicalType drive the logicalType using sql import mapping

* use custom --schema option to align with v0.12.0

* use snake_case for variable names

* typos & ruff format

* Snowflake environment variable table updated

* refactor based on comments

* refactor

* ruff applied

* test fix

* refactor with github copilot

* init

* add change log

* change pbix into pbit

* init

* datacontract binding schema properties on the azure blob metadata

* run ruff format and ruff check --fix

* move references from customproperties to relationship object, add id and expression into transformationLogic

* improve add relationship

* format fix and comment changes applied

* fix test sequence

* pass the test

* refactor

* format fix

* lib dependency fix

* remove docstring

* rfc pending

* init

* slightly modify fixture & README

* add template schema's properties customProperties start

* changelog update

* Update pyproject.toml

Co-authored-by: Jakob Schödl <jakob.schoedl@mailbox.org>

* Update pyproject.toml

Co-authored-by: Jakob Schödl <jakob.schoedl@mailbox.org>

* refactor, add schema logicalType (blob as experiment)

* init

* error code typo

* refactor format for powerbi-semantic-model

* format and ruff check

* undo changes that are already in #1234 and #1268

* refactor: polish Power BI importer naming, types, and tests

Rename ImportFormat.powerbi → powerbi_semantic_model and PowerBIImporter
→ PowerBiImporter for naming consistency, modernize type hints to builtin
generics, fix invalid from_table.name in relationship wiring, and
tighten the test suite (add list-form DAX coverage, drop redundant tests).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* improve readme powerbi instruction

* improve powerbi pbit (.zip) import test

* normalize schema name everywhere

* update changelog and readme

---------

Co-authored-by: Bodo Huesemann <bhuesemann@users.noreply.github.com>
Co-authored-by: Jochen Christ <jochen.christ@entropy-data.com>
Co-authored-by: Copilot <copilot@github.com>
Co-authored-by: Jakob Schödl <jakob.schoedl@entropy-data.com>
Co-authored-by: Jakob Schödl <jakob.schoedl@mailbox.org>
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@dmaresma

Copy link
Copy Markdown
Contributor Author

Wow @jschoedl thanks a lot.

jschoedl and others added 2 commits June 17, 2026 11:27
…ection errors

- Filter blob schemas by schema_name so --schema-name limits what runs (datacontract#1234)
- Thread check_categories through so --checks schema/quality filters per-check
- Classify connection vs auth failures in list_blobs with actionable messages
- Add end-to-end DataContract.test() test guarding the wiring against merge loss
- Drop redundant _parse_location https unit test

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…issing_env failures

Blob schemas (logicalType: blob) are owned by check_azure_blob_file, which validates
file metadata. The SQL engine was also generating checks for them, trying to open
blobs as DuckDB tables over Azure—which failed with missing_env_DATACONTRACT_AZURE_TENANT_ID
when no SPN credentials were set. Blob schemas require no SQL connection, only the
blob client. Added _is_azure_blob_schema helper to skip them in create_checks.

Extended end-to-end wiring test to catch this regression: assert no ibis checks
are generated for a blob contract and the run passes without credentials.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Comment thread datacontract/engines/datacontract/check_azure_blob_file.py Outdated
@jschoedl jschoedl merged commit 547faa1 into datacontract:main Jun 18, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

ODCS test image or binaries on the top of a local / cloud blob storage, adlsgen2, aws S3 and GCP storage

6 participants