Developer Guide#
Setting uo the Development Environment#
Setting up the development environment is a critical step in contributing to TEEHR. This guide will walk you through the steps to set up a development environment for TEEHR.
Install the prerequisites: - Python 3.10 or later - Poetry - Java 11 or later for Spark (we use 17)
1. Clone the TEEHR repository from GitHub:
`bash
git clone https://github.com/RTIInternational/teehr.git
`
2. Navigate to the TEEHR directory:
`bash
cd teehr
`
3. Create a new virtual environment using poetry:
`bash
poetry install
`
4. Activate the virtual environment:
`bash
poetry shell
`
5. Install the required JAR files for Spark:
`bash
python download_spark_jars.py
`
Contributing Guidelines#
These contributing guidelines will be updated as we progress. They are pretty slim to start.
TEEHR has multiple parts, one is a library of reusable code that can be imported as a dependency to another project, another is examples and dashboards which are more use case specific (e.g., a dashboard to conduct post event analysis). The guidelines for contributing may be a bit different.
Library Code#
Git LFS#
Use git lfs for large files. Even better keep large files out of the repo.
Notebooks#
Do not commit notebook output to the repo. Use can install and use nbstripout to strip output. After cloning, you must run nbstripout –install.
nbstripoutput is configured to strip output from notebooks to keep the size down and make diffing files easier. See kynan/nbstripout. The configuration is stored in the .gitattributes file, but the tool must be installed per repo. You may need to install the Python package first with conda install nbstripout or similar depending on your environment.
Local Development#
The most common way to use TEEHR is by installing it in a Python virtual
environment. The document covers using a conda virtual environment, but
there is no hard requirement to do so. In this case the packages are not
installed, so you need to make sure you add src/
to your Python path.
There are two way to do this below, but depending on your development
environment, your milage may vary.
TODO: Poetry docs
Release Process#
This document describes the release process which has some manual steps to complete.
Create branch with the following updated to the new version (find and replace version number):
version.txt
README.md
pyproject.toml
src/teehr/__init__.py
docs/sphinx/getting_started/index.rst
Update the changelog at docs/sphinx/changelog/index.rst
to reflect the changes included in the release.
If also pushing changes to TEEHR-HUB, also update tags in teehr-hub/helm-chart/config.yaml
.
Make a PR to main. After PR has been reviewed and merged, checkout main
pull changes and tag the commit.
git checkout main
git pull
git tag -a v0.x.x -m "version 0.x.x"
git push origin v0.x.x
Tagging will trigger a docker container build and push to the AWS registry for deployment to TEEHR-HUB. Deployment to TEEHR-HUB is a manual process that requires the correct credentials.
Contributing to the Documentation#
description
docstring approach (numpy)
pre-commit validation
building and pushing docs
The documentation files are in the docs/sphinx
directory.
To build the documentation html files, navigate to docs/sphinx
and run:
make clean html
Check your files locally in a browser such as Firefox:
firefox _build/html/index.html &
Some pre-commit hooks are configured automatically run when you commit some code. These check for things like large files, docstring formatting, added whitespace, etc. To run these manually and print the results to a text file pre-commit-output.txt, run:
pre-commit run --all-files > pre-commit-output.txt