Author: Roman Dementiev (2007)

This page gives an example of how an application using "traditional" containers and algorithms can be converted into using pipelines streaming.

The purpose of our example is to generate a huge random directed graph in a sorted edge array representation, i.e. the edges in the edge array must be sorted lexicographically. The definitions of the classes edge, random_edge and edge_cmp are in the following code listing.

struct edge { // edge class
    int src, dst; // nodes
    edge() {}
    edge(int src_, int dst_): src(src_), dst(dst_) {}
    bool operator == (const edge & b) const {
        return src == b.src && dst == b.dst;
    }
};
struct random_edge { // random edge generator functor
    edge operator () () const {
        edge Edge(random() - 1, random() - 1);
        while(Edge.dst == Edge.src)
             Edge.dst = random() - 1 ; // no self-loops
        return Edge;
    }
};
struct edge_cmp { // edge comparison functor
    edge min_value() const {
        return edge(std::numeric_limits<int>::min(),0); };
    edge max_value() const {
        return edge(std::numeric_limits<int>::max(),0); };
    bool operator () (const edge & a,
                      const edge & b) const {
        return a.src < b.src || (a.src == b.src && a.dst < b.dst);
    }
};

A straightforward procedure to generate the graph is to: 1) generate a sequence of random edges, 2) sort the sequence, 3) remove duplicate edges from it. If we ignore definitions of helper classes the STL/STXXL code of the algorithm implementation is only five lines long:

// create vector
stxxl::vector<edge> ExtEdgeVec(10000000000ULL);
// generate random edges
stxxl::generate(ExtEdgeVec.begin(), ExtEdgeVec.end(), random_edge());
// sort edges by source vertex
stxxl::sort(ExtEdgeVec.begin(), ExtEdgeVec.end(), edge_cmp(), 512*1024*1024);
// unify equal edges
stxxl::vector<edge>::iterator NewEnd =  std::unique(ExtEdgeVec.begin(), ExtEdgeVec.end());
ExtEdgeVec.resize(NewEnd - ExtEdgeVec.begin());

Line 2 creates an STXXL external memory vector with 10 billion edges. Line 4 fills the vector with random edges (stxxl::generate from the STL is used). In the next line the STXXL external memory sorter sorts randomly generated edges using 512 megabytes of internal memory. The lexicographical order is defined by functor my_cmp, stxxl::sort also requires the comparison functor to provide upper and lower bounds for the elements being sorted. Line 8 deletes duplicate edges in the external memory vector with the help of the STL std::unique algorithm. The NewEnd vector iterator points to the right boundary of the range without duplicates. Finally (in the last line), we chop the vector at the NewEnd boundary.

Now we count the number of I/Os performed by this example: external vector construction takes no I/Os; filling with random values requires a scan — $ N/DB $ I/Os; sorting will take $ 4N/DB $ I/Os; duplicate removal needs no more than $ 2N/DB $ I/Os; chopping a vector is I/O-free. The total number of I/Os is $ 7N/DB $ .

Pipelined random graph generation

Now we "pipeline" the random graph generation example shown in the previous chapter. The data flow graph of the algorithm is presented in the following figure:

Pipeline of Random Graph Generator

The following code listing shows the pipelined code of the algorithm, the definitions of edge, random_edge, and edge_cmp are assumed to be available from the listing in the previous section.

class random_edge_stream {
    stxxl::int64 counter;
    edge current;
    random_edge_stream();
public:
    typedef edge value_type;
    random_edge_stream(stxxl::int64 elements)
        : counter(elements), current(random_edge()())
    { }
    const edge & operator * () const { return current; }
    const edge * operator ->() const { return &current; }
    random_edge_stream & operator ++ ()
    {
        --counter;
        current = random_edge()();
        return *this;
    }
    bool empty() const { return counter == 0; }
};
random_edge_stream RandomStream(10000000000ULL);
typedef stxxl::stream::sort<random_edge_stream, edge_cmp> sorted_stream;
sorted_stream SortedStream(RandomStream, edge_cmp(), 512*1024*1024);
typedef stxxl::stream::unique<sorted_stream> unique_stream_type;
unique_stream_type UniqueStream(SortedStream);
stxxl::vector<edge> ExtEdgeVec(10000000000ULL);
stxxl::vector<edge>::iterator NewEnd =
    stxxl::stream::materialize(UniqueStream, ExtEdgeVec.begin());
ExtEdgeVec.resize(NewEnd - ExtEdgeVec.begin());

Since the sorter of the streaming layer accepts an stream input, we do not need to output the random edges. Rather, we generate them on the fly. The random_edge_stream object (model of stream) supplies the sorter with a stream of random edges. We define the type of the sorter node as sorted_stream; it is parameterized by the type of the input stream and the type of the comparison function object. Then a SortedStream object is created and its input is attached to the RandomStream object's output. The internal memory consumption of the sorter stream object is limited to 512 MB. The UniqueStream object filters the duplicates in its input edge stream. The generic stream::unique stream class stems from the STXXL library. The stream::materialize function records the content of the UniqueStream into the external memory vector. As in the previous non-pipelined version, we cut the vector at the NewEnd boundary.

Let us count the number of I/Os the program performs: random edge generation by RandomStream costs no I/O; sorting in SortedStream needs to store the sorted runs and read them again to merge — $ 2N/DB $ I/Os; UniqueStream deletes duplicates on the fly, it does not need any I/O; and materializing the final output can cost up to $ N/DB $ I/Os. All in all, the program only incurs $ 3N/DB $ I/Os, compared to $ 7N/DB $ for the non-pipelined code.