This repository contains a C++ template library that essentially counts sufficient statistics on binary arrays. Its primary goal is to provide a general framework for building discrete exponential-family models. A particular example is Exponential Random Graph Models (ERGMs), but we can use barry
to deal with non-square arrays.
Among the key features included in barry
, we have:
- Sparse arrays.
- User-defined count statistics.
- User-defined constrain of the support set.
- Powerset generation of binary arrays.
- Discrete Exponential Family Models module (DEFMs).
- Pooled DEFMs.
To use barry, you can either download the entire repository or, since it is header-only, the single header version barry.hpp
.
This library was created and maintained by Dr. George G. Vega Yon as part of his doctoral dissertation "Essays on Bioinformatics and Social Network Analysis: Statistical and Computational Methods for Complex Systems."
In the following code we create an array of size 5x5 of class Network
(available in the namespace netcounters), add/remove ties, print the
graph, and count common statistics used in ERGMs:
#include <iostream>
#include <ostream>
#include "../include/barry.hpp"
typedef std::vector< unsigned int > vuint;
int main() {
// Creating network of size six with five ties
netcounters::Network net(
6, 6,
{0, 0, 4, 4, 2, 0, 1},
{1, 2, 0, 2, 4, 0, 1}
);
// How does this looks like?
net.print("Current view");
// Adding extra ties
net += {1, 0};
net(2, 0) = true;
// And removing a couple
net(0, 0) = false;
net -= {1, 1};
net.print("New view");
// Initializing the data. The program deals with freing the memory
net.set_data(new netcounters::NetworkData, true);
// Creating counter object for the network and adding stats to count
netcounters::NetStatsCounter counter(&net);
netcounters::counter_edges(counter.counters);
netcounters::counter_ttriads(counter.counters);
netcounters::counter_isolates(counter.counters);
netcounters::counter_ctriads(counter.counters);
netcounters::counter_mutual(counter.counters);
// Counting and printing the results
std::vector< double > counts = counter.count_all();
std::cout <<
"Edges : " << counts[0] << std::endl <<
"Transitive triads : " << counts[1] << std::endl <<
"Isolates : " << counts[2] << std::endl <<
"C triads : " << counts[3] << std::endl <<
"Mutuals : " << counts[4] << std::endl;
return 0;
}
Compiling this program using g++
g++ -std=c++11 -Wall -pedantic 08-counts.cpp -o counts && ./counts
Yields the following output:
Current view
[ 0,] 1 1 1 . . .
[ 1,] . 1 . . . .
[ 2,] . . . . 1 .
[ 3,] . . . . . .
[ 4,] 1 . 1 . . .
[ 5,] . . . . . .
New view
[ 0,] . 1 1 . . .
[ 1,] 1 . . . . .
[ 2,] 1 . . . 1 .
[ 3,] . . . . . .
[ 4,] 1 . 1 . . .
[ 5,] . . . . . .
Edges : 7
Transitive triads : 3
Isolates : 2
C triads : 1
Mutuals : 3
One of the key features of barry
is that it will handle memory efficiently. In the case of pooled-data models, the module for statistical models avoids double-counting support when possible by keeping track of what datasets (networks, for instance) share the same.
More information can be found in the Doxygen website here and in the PDF version of the documentation here.
Please note that the barry
project is released with a
Contributor Code of Conduct.
By contributing to this project, you agree to abide by its terms.