Geoconnex Scheduler

The Geoconnex scheduler crawls water data metadata on a schedule and synchronizes with a graph database.

it crawls sitemaps with nabu harvest and downloads it to an S3 bucket
it syncs data between the S3 bucket and the graph database using nabu sync

For more information about the Geoconnex project generally and how it aims to improve water data infrastructure, see the Geoconnex docs.

Local / Development Quickstart

Important

You must have uv installed for package management

Install dependencies and spin up necessary Docker services:

make deps && make dev

Then go to localhost:3000

Dockerized / Production Quickstart

Spin up all services as containers including user code and local db/s3 containers (make sure to set the DAGSTER_POSTGRES_HOST env var to dagster_postgres)

make prod

Spin up user code and essential services but not storage (You will need to specify your db/s3 endpoints and any other remote services in the .env file)

make cloudProd

All cloud deployment and infrastructure as code work is contained within the harvest.geoconnex.us repo

Configuration

All env vars must be defined in .env at the root of the repo
The .env.example file will be copied to .env if it does not exist

Testing

Spin up the local dev environment
Run make test from the root to execute tests
If you use VSCode, run the task dagster dev in the debug panel to run the full pipeline with the ability to set breakpoints

Licensing

This repository is a heavily modified version of gleanerio/scheduler and is licensed under Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 343 Commits
.github/workflows		.github/workflows
.vscode		.vscode
Docker		Docker
docs		docs
test		test
userCode		userCode
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
action.yml		action.yml
codecov.yml		codecov.yml
conftest.py		conftest.py
dagster.yaml		dagster.yaml
makefile		makefile
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Geoconnex Scheduler

Local / Development Quickstart

Dockerized / Production Quickstart

Configuration

Testing

Licensing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

License

internetofwater/scheduler

Folders and files

Latest commit

History

Repository files navigation

Geoconnex Scheduler

Local / Development Quickstart

Dockerized / Production Quickstart

Configuration

Testing

Licensing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

Packages