Compute distributed RDF dataset statistics.
Compute distributed RDF dataset statistics.
VoID description of the given dataset
2. Class Usage Count Criterion
Count the usage of respective classes of a datase,
the filter rule that is used to analyze a triple is the
same as in the first criterion.
2. Class Usage Count Criterion
Count the usage of respective classes of a datase,
the filter rule that is used to analyze a triple is the
same as in the first criterion.
As an action a map is being created having class IRIs as
identifier and its respective usage count as value.
If a triple is conform to the filter rule the respective
value will be increased by one.
Filter rule : ?p=rdf:type && isIRI(?o)
Action : M[?o]++
DataSet of classes used in the dataset and their frequencies.
3. Classes Defined Criterion
Gets a set of classes that are defined within a
dataset this criterion is being used.
3. Classes Defined Criterion
Gets a set of classes that are defined within a
dataset this criterion is being used.
Usually in RDF/S and OWL a class can be defined by a triple
using the predicate rdf:type
and either rdfs:Class
or
owl:Class
as object.
The filter rule illustrates the condition used to analyze the triple.
If the triple is accepted by the rule, the IRI used as subject is added to the set of classes.
Filter rule : ?p=rdf:type && isIRI(?s) &&(?o=rdfs:Class||?o=owl:Class)
Action : S += ?s
DataSet of classes defined in the dataset.
16. Distinct entities
Count distinct entities of a dataset by filtering out all IRIs.
16. Distinct entities
Count distinct entities of a dataset by filtering out all IRIs.
Filter rule : S+=iris({?s,?p,?o})
Action : S
DataSet of distinct entities in the dataset.
Distinct Objects
Count distinct objects within triples.
Distinct Objects
Count distinct objects within triples.
Filter rule : isURI(?o)
Action : M[?o]++
DataSet of objects used in the dataset.
Distinct Subjects
Count distinct subject within triples.
Distinct Subjects
Count distinct subject within triples.
Filter rule : isURI(?s)
Action : M[?s]++
DataSet of subjects used in the dataset.
32. Object vocabularies
Compute object vocabularies/namespaces used through the dataset.
32. Object vocabularies
Compute object vocabularies/namespaces used through the dataset.
Filter rule : ns=ns(?o)
Action : M[ns]++
DataSet of distinct object vocabularies used in the dataset and their frequencies.
31. Predicate vocabularies
Compute predicate vocabularies/namespaces used through the dataset.
31. Predicate vocabularies
Compute predicate vocabularies/namespaces used through the dataset.
Filter rule : ns=ns(?p)
Action : M[ns]++
DataSet of distinct predicate vocabularies used in the dataset and their frequencies.
Properties Defined
Count the defined properties within triples.
Properties Defined
Count the defined properties within triples.
Filter rule : ?p=rdf:type && (?o=owl:ObjectProperty ||
?o=rdf:Property)&& !isIRI(?s)
Action : M[?p]++
DataSet of predicates defined in the dataset.
5. Property Usage Criterion
Count the usage of properties within triples.
5. Property Usage Criterion
Count the usage of properties within triples.
Therefore an DataSet will be created containing all property
IRI's as identifier.
Afterwards, their frequencies will be computed.
Filter rule : none
Action : M[?p]++
DataSet of predicates used in the dataset and their frequencies.
30. Subject vocabularies
Compute subject vocabularies/namespaces used through the dataset.
30. Subject vocabularies
Compute subject vocabularies/namespaces used through the dataset.
Filter rule : ns=ns(?s)
Action : M[ns]++
DataSet of distinct subject vocabularies used in the dataset and their frequencies.
1. Used Classes Criterion
Creates an DataSet of classes are in use by instances of the analyzed dataset.
1. Used Classes Criterion
Creates an DataSet of classes are in use by instances of the analyzed dataset.
As an example of such a triple that will be accepted by
the filter is sda:Gezim rdf:type distLODStats:Developer
.
Filter rule : ?p=rdf:type && isIRI(?o)
Action : S += ?o
DataSet of classes/instances