Skip to content

Added initial version of AGENTS.md#14

Open
alexott wants to merge 2 commits into
databricks-industry-solutions:mainfrom
alexott:agents-md-first-version
Open

Added initial version of AGENTS.md#14
alexott wants to merge 2 commits into
databricks-industry-solutions:mainfrom
alexott:agents-md-first-version

Conversation

@alexott

@alexott alexott commented Jan 2, 2026

Copy link
Copy Markdown
Collaborator

This is modified version of my own agents file, works quite well on my projects.

This is modified version of my own agents file, works quite well on my projects.
@alexott alexott requested a review from dmoore247 January 2, 2026 13:35
@dmoore247

Copy link
Copy Markdown
Collaborator

@alexott I love the theme here. The hold up on the merge is the automated build scripts are failing.

@dmoore247 dmoore247 left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@alexott

alexott commented Apr 4, 2026

Copy link
Copy Markdown
Collaborator Author

Maybe we don't even need it - I already submitted it to the ai-dev-kit: https://github.com/databricks-solutions/ai-dev-kit/tree/main/databricks-skills/spark-python-data-source

@dmoore247

Copy link
Copy Markdown
Collaborator

@alexott how about we add the link to the ai-dev-kit in the agents.md.
As discussed, we'll merge this after @ghanse restructuring PR gets merged.

@CLAassistant

Copy link
Copy Markdown

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ alexott
❌ ghanse
You have signed the CLA already but the status is still pending? Let us recheck it.

@ghanse ghanse left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left a few comments

Comment thread AGENTS.md
Comment on lines +91 to +107
Follow this checklist (use existing sources as templates):

1. Create a new folder with the project skeleton.
2. Create new file `src/python_datasource_connectors/YourSource.py`
3. Implement `YourSourceDataSource(DataSource)` with `name()`, `writer()`, `streamWriter()`
4. Implement base writer class with:
- Options validation in `__init__`
- `write(iterator)` method with write logic
5. Implement batch and stream writer classes (minimal boilerplate)
6. Implement base reader class with:
- Options validation in `__init__`
- `read(partition)` method with read logic
- `partitions(start, end)` method to split data into partitions
7. Implement batch and stream writer classes (minimal boilerplate)
8. Add exports to `python_datasource_connectors/__init__.py`
9. Create test file `tests/test_yoursource.py` with unit tests
10. Update `README.md` with usage examples and options

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the example paths in this section need to be updated after the changes we made to the repo structure?

Comment thread AGENTS.md
### Implementation Rules

1. **One concept per file**: Each module should have a single, clear purpose
2. **Functions over classes**: Prefer functions unless you need state management

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will this work given that each custom source needs to extend the base data source classes?

Comment thread AGENTS.md
Comment on lines +194 to +225
### Testing
```bash
# Run all tests
poetry run pytest

# Run specific test file
poetry run pytest tests/test_ds.py

# Run single test
poetry run pytest tests/test_ds.py::TestXxxxDataSource::test_name

# Run with verbose output
poetry run pytest -v
```

### Building
```bash
# Build wheel package
poetry build

# Output will be in dist/ directory
```

### Code Quality
```bash
# Format and lint code (ruff)
poetry run ruff check cyber_connectors/
poetry run ruff format cyber_connectors/

# Type checking
poetry run mypy cyber_connectors/
```

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we should update to the makefile commands?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants