@jbr-hook/sparql-endpoint-comunica
TypeScript icon, indicating that this package has built-in type declarations

3.0.2 • Public • Published

JBR Hook - SPARQL Endpoint Comunica

Build status Coverage Status npm version

A jbr hook type for a Comunica-based SPARQL endpoint.

Requirements

  • Node.js (1.12 or higher)
  • Docker (required for starting a Comunica SPARQL endpoint inside a Docker container)
  • An existing jbr experiment that requires a SPARQL endpoint hook.

Configure an experiment hook

If an experiment requires a hook for a SPARQL endpoint, then you can install this Comunica-based SPARQL endpoint as follows.

$ jbr set-hook someHookSparqlEndpoint sparql-endpoint-comunica

Output

The following output is generated after an experiment with this hook has run.

output/stats-sparql-endpoint-comunica.csv: Per second of the experiment: CPU percentage, memory usage (bytes), memory percentage, received bytes, transmitted bytes.

cpu_percentage,memory,memory_percentage,received,transmitted
9.915362228116711,10489856,0.5024267940030527,488,0
9.863725050505051,17354752,0.8312308965993495,648,0
9.64850952141058,19116032,0.915589944401502,738,0
9.345685076142132,23072768,1.105103526208198,738,0
10.029959365079364,26759168,1.2816689750964243,738,0
10.25411566137566,30363648,1.45431074734269,738,0

output/logs/sparql-endpoint-comunica.txt: Logs of the Comunica SPARQL endpoint.

Configuration

When installing this hook, your configuration file (jbr-experiment.json) will contain the following:

...
  "someHookSparqlEndpoint": {
    "@id": "urn:jrb:bb:hookSparqlEndpoint",
    "@type": "HookSparqlEndpointComunica",
    "dockerfileClient": "input/dockerfiles/Dockerfile-client",
    "resourceConstraints": {
      "@type": "DockerResourceConstraints",
      "cpu_percentage": 90
    },
    "configClient": "input/config-client.json",
    "additionalBinds": [ ],
    "clientPort": 3001,
    "clientLogLevel": "info",
    "queryTimeout": 300,
    "maxMemory": 8192
  }
...

Any config changes require re-running the prepare step.

More background information on these config options can be found on https://comunica.dev/.

Configuration fields

  • dockerfileClient: Path to the Dockerfile of Comunica.
  • resourceConstraints: Resource constraints for the Docker container.
  • configClient: Path to the configuration file of a Comunica engine.
  • clientPort: HTTP port on which the SPARQL endpoint will be exposed.
  • clientLogLevel: Logging level for Comunica engine.
  • queryTimeout: Timeout in seconds for a single SPARQL query execution.
  • maxMemory: Maximum amount of Memory for the Comunica Node.js process in MB.
  • additionalBinds: The Local file bindings to the client dockerfile.

Querying local files

By default, the hook handler sparql-endpoint-comunica expects the dataset to be served through http. If you want to use a local file as source for the comunica endpoint you can bind a local dataset to comunica docker container. To do so, follow these steps:

First, use lerna-docker to create a docker image of engines/query-sparql-file. For this use the dockerfile in engines/query-sparql. Then generate an experiment and add the sparql-endpoint-comunica hook like explained above. In the generated experiment, you should change the client dockerfile:

my-experiment/
  input/
    dockerfiles
      Dockerfile-client

Change the default docker image to your newly created docker image. For example:

FROM comunica/query-sparql:v2.5.0 -> FROM comunica/query-sparql-file:v2.5.0

Furthermore, change the config-client.json and context-client.json

my-experiment/
  input/
    config-client.json
    context-client.json

In the config-client.json change the import of

ccqs:config/config-default.json

to

ccqs:config/config-file.json

Then in the context-client.json set sources to

"sources": [ "/tmp/dataset.nt" ]

This path should point to where you want your dataset to be available in the Docker image.

Finally, we want to bind our dataset to the sources path we've specified above. This is done in the jbr-experiment.json

  my-experiment/
    jbr-experiment.json

By default the entry for the hookSparqlEndpoint should look like this

  "hookSparqlEndpoint": {
    "@id": "urn:jbr:testerdetest:hookSparqlEndpoint",
    "@type": "HookSparqlEndpointComunica",
    "dockerfileClient": "input/dockerfiles/Dockerfile-client",
    "resourceConstraints": {
      "@type": "StaticDockerResourceConstraints",
      "cpu_percentage": 100
    },
    "configClient": "input/config-client.json",
    "contextClient": "input/context-client.json",
    "additionalBinds": [],
    "clientPort": 3001,
    "clientLogLevel": "info",
    "queryTimeout": 300,
    "maxMemory": 8192
  }

In this, we set

  "additionalBinds": ["/generated/dataset.nt:/tmp/dataset.nt"] 

The first part before the : denotes where our dataset is found locally, while the second part shows where the dataset will be available in the Docker image.

License

jbr.js is written by Ruben Taelman.

This code is copyrighted by Ghent University – imec and released under the MIT license.

Package Sidebar

Install

npm i @jbr-hook/sparql-endpoint-comunica

Weekly Downloads

0

Version

3.0.2

License

MIT

Unpacked Size

30 kB

Total Files

17

Last publish

Collaborators

  • rubensworks