Semantic Analytics Stack (SANSA)

Open Source Algorithms for Distributed Data Processing for Large-scale RDF Knowledge Graphs
GetStarted-4

SANSA-Stack’s core is a processing data flow engine that provides data distribution, and fault tolerance for distributed computations over RDF large-scale datasets.

SANSA includes several libraries for creating applications:

  1. Read / Write RDF / OWL library for RDF/OWL operations,
  2. Querying library  support a query language on top of distributed RDF/OWL library,
  3. Inference library implements rule-based reasoning on RDF/OWL data,
  4. ML- Machine Learning core library

SANSA is easily integrated with well-known open source systems both for data input and output (HDFS) and is build on top of Spark and Flink.

|


SANSA-Stack Architecture


Smart Data AnalyticsSANSA is a research project of the Smart Data Analytics research group.

Getting Started

Download the latest release and run SANSA on your machine, cluster or the cloud: 


SANSA-Stack-33drawit-diagram-31SANSA-Stack-34

 

The documentation provided on setup guide give an overview for all deployment options.

Community

If you have question related to SANSA community then you can post in on various channels:

Supported By

Uni_Bonn_newlogo

logo-infai

logo-iais

logo-BigDataEurope