This reference documentation details all available configuration options that can be specified in a collection’s configuration file to configure the Open Terms Archive engine.
As an example, see the production configuration file of the Demo collection.
trackingSchedule
string30 */12 * * *
(runs every 12 hours at minute 30)collectionPath
string./
../collections/demo-declarations
The recorder section manages how versions and snapshots of terms are stored, supporting multiple storage backends.
recorder.versions.storage
objectrecorder.snapshots.storage
objectThe fetcher section configures how the engine retrieves documents from the web.
fetcher.waitForElementsTimeout
number10000
fetcher.navigationTimeout
number30000
fetcher.language
stringen
The notifier section sets up how notifications are sent when new versions of terms are recorded.
notifier.sendInBlue.updatesListId
string850
notifier.sendInBlue.updateTemplateId
string7
The logger section configures logging and error notification settings.
logger.smtp.host
stringsmtp-relay.sendinblue.com
logger.smtp.username
stringadmin@opentermsarchive.org
logger.sendMailOnError.to
stringadmin@example.com
logger.sendMailOnError.from
stringnoreply@example.com
logger.sendMailOnError.sendWarnings
booleanfalse
logger.timestampPrefix
booleantrue
The reporter section manages how issues are reported when terms content is inaccessible, supporting GitHub and GitLab.
reporter.type
stringgithub
github
, gitlab
reporter.repositories.declarations
stringOpenTermsArchive/demo-declarations
reporter.repositories.versions
stringOpenTermsArchive/demo-versions
reporter.repositories.snapshots
stringOpenTermsArchive/demo-snapshots
reporter.baseURL
stringhttps://gitlab.example.com
reporter.apiBaseURL
stringhttps://api.gitlab.example.com
The dataset section configures how datasets are published.
dataset.title
stringsandbox
dataset.versionsRepositoryURL
stringhttps://github.com/OpenTermsArchive/sandbox
dataset.publishingSchedule
string30 8 * * MON
(runs every Monday at 8:30 AM)The collection API section sets the parameters for the API server.
collection-api.api.port
number
Required8080
collection-api.api.basePath
string
Required/collection-api
The storage repositories section set the parameters for supported backends for storing versions and snapshots, supporting Git and MongoDB.
storage.type
stringgit
git
, mongo
The Git storage configuration allows to store versions in a Git repository.
storage.git.path
string./data/versions
storage.git.publish
booleanfalse
storage.git.snapshotIdentiferTemplate
string%SNAPSHOT_ID
will be replaced with the actual snapshot ID../data/snapshots/%SNAPSHOT_ID
storage.git.author.name
stringOpen Terms Archive Bot
storage.git.author.email
stringbot@opentermsarchive.org
The MongoDB storage configuration allows to store versions in a MongoDB database.
storage.mongo.connectionURI
stringmongodb://127.0.0.1:27017
storage.mongo.database
stringopen-terms-archive
storage.mongo.collection
stringsnapshots