Getting Started with SANSA-Stack

This document summarizes all instructions to help first time users to get and use SANSA-Stack.


Set up SANSA

In order to get quickly started, SANSA provides project templates for the following build tools: Maven and SBT.

Maven

  • Use this Maven template to generate a SANSA project using Apache Spark.

    The subsequent steps depend on your IDE. Generally, just import this repository as a Maven project and start using SANSA / Spark.

  • Use this Maven template to generate a SANSA project using Apache Flink.

    The subsequent steps depend on your IDE. Generally, just import this repository as a Maven project and start using SANSA / Flink.

SBT

  • Use this SBT template to generate a SANSA project using Apache Spark.

    The subsequent steps depend on your IDE. Generally, just import this repository as a SBT project and start using SANSA / Spark.

  • Use this SBT template to generate a SANSA project using Apache Flink.

    The subsequent steps depend on your IDE. Generally, just import this repository as a SBT project and start using SANSA / Flink.

These templates help you to set up the project structure and to create the initial build files. Enjoy it! :)

SANSA-Notebooks

Interactive Spark Notebooks for running SANSA-Examples. In this repository you will find a docker-compose.yml for running Hadoop/Spark cluster locally. The cluster also includes Hue for navigation and copying file to HDFS. The notebooks are created and run using Apache Zeppelin.

Configuring the Computing Frameworks

Running SANSA on Apache Spark.

SANSA Version Spark Version
v0.1.0 2.0.x
v0.2.0 2.1.x

Running SANSA on Apache Flink.

SANSA Version Flink Version
v0.1.0 1.1.x
v0.2.0 1.3.x

Using SANSA in Maven Projects

If you want to directly write an application on top of SANSA, simply add the following dependencies to your pom.xml to include SANSA in your project.

On Spark applications On Flink applications