Simulator Usage

Source file information

Currently Makefile is set to asynchronousupdates/priority-vertex-scheduling/graph-slicing PolyGraph model.
To run, first build DRAMSim2: make libdramsim.so and do export LD_LIBRARY_PATH=path-to-dramsim2-folder/:$LD_LIBRARY_PATH (you may put this in bashrc)
Let's try a simple example using cora dataset in sample_dataset folder (format: src dst):

make sim-polygraph csr_file=\\\"/home/vidushi/graphsim-simulator/sample_datasets/cora_csr\\\"  V=2708 E=10556

For other variants, find details below:

PolyGraph with cache of 4 MB (4096 kB) for uniform-degree graphs:

make sim-polygraph csr_file=\\\"/home/vidushi/graphsim-simulator/sample_datasets/cora_csr\\\"  V=2708 E=10556 WORKING_CACHE=1 L2SIZE=4096

Note: The heuristic graph is not encoded in the simulator properly yet, the programmer specifies which variant to use, when to switch dynamically (later one is a little complex, please contact for more details).

You should see output like this:

Cycles: 861
Barrier cycles: 42
Total cycles: 903
L2 hit rate: 0.984996
L2 accesses/edges: 11330 and size: 4096 kB
tasks created online: 5414
Time: 861 ns
GTEPS: 0
Real GTEPS: 12.2602

Simulator Usage

Statistics

Here is the description of the most commonly used statistics.

Cycles = cycles taken by the algorithm
Barrier cycles = these are cycles taken for switching slices (see Figure 8).
Total cycles = addition of the actual computation and barrier cycles.
GTEPS = number of giga-edges traversed per second (assuming the frequency of 1 GHz)
Real GTEPS = GTEPS normalized to work-efficiency of the algorithm
Local/remote updates = number of atomic update tasks (Figure 7) that were local/remote to the access-task core
Load per core = number of access-vertex tasks executed at a core
Exact hit rate = number of hits to the cache divided by the total number of accesses to the cache

Simulator knobs

Below I have put the most common knobs used to try different algorithm variants:

Update-visibility = for async: use sim-polygraph, for sync-slice: ise sim-polygraph with ABCD=1, for graph-sync: use sim-graphmat mode for
FIFO = round-robin or priority vertex scheduling
WORKING_CACHE = whether to use non-sliced or sliced
SLICE_COUNT = number of slices for slice scheduling
PULL = pull or push (push by default)

These are the knobs provided by the simulator to change the studied algorithm or evaluated architecture (all declared in config.hh).

Feature	Allowed input dimensions	Representative
Graph_shape	Low dia, high dia	Road, amazon or Synthetic
GP-size	Fit in memory, not	2MB, 2GB, 4GB
algo_order	Order sens (sp, pr, cf, astar) or not (bfs, cc, gcn+ladies sample)	Sp, bfs, pr
Frontier	Frontier (sssp, bfs, cc), Non-frontier (pr, , cf, gcn)	Sp, bfs, pr
Feature	Allowed architecture dimensions	Idealized dimension
net_topology	mesh, crossbar, hrc_xbar	xbar
net_traffic_type	Basic_net, decomposable, real_multicast, path_multicast, ideal_net	Infinite bandwidth
cache_repl_type	lru, phi, allhits, allmiss	All hits
mem_type	Dram_mem, ideal_mem	Inf memory bandwidth
spatial_part_type	random_spatial, linear, modulo, dfs_map, bfs_map, bdfs, bbfs, blocked_dfs, noload, remapping, automatic	Always local mapping
work_dist_type	addr_map, work_steal	Single core with combined compute bw (all can fwd?)
temporal_sched	Prio, non-prio	prio-inf-length
Feature	Allowed programming dimensions (assum flex arch)	Take max of these?
task_sched_type	datadep, fifo, abcd, vertex_id	(indep of architecture??)
slice_sched_type	roundrobin, locality, priority
update_visibility	synch, async-coarse, asyncfine, dyn_graph fine-sync (dijkstra?), speculative
dyn_algo_type	inc, recomp

Graphsim components

completion buffer: This class implements a reorder buffer for the requests to main memory.
memory controller: This class is responsible to send requests and receive
response to/from caches or main memory.
task controller: This class implements the vertex scheduling algorithm variant
for asynchronous algorithms.
scratch controller: This class implements spatial partitioning of graph across scratchpad banks. It also handles atomic updates to these banks.
asic: This class implements the control core, which manages the slice scheduling variant and data orchestraction during slice switching (See Figure 8 in PolyGraph paper (https://ieeexplore.ieee.org/document/9499835).
asic core: This class implements the vertex processing algorithm's datapath. It implements both pull/push and update scheduling variants. The stages in the datapath are: -> Prefetch source vertex prop Task queue dispatch -> Prefetch edge -> process edge -> reduce -> atomic update -> dynamic task creation (apply) -> Aggregation buffer stage -> Push to task queue

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
DRAMSim2		DRAMSim2
sample_datasets		sample_datasets
.gitignore		.gitignore
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
accel_constants.h		accel_constants.h
accel_knobs.h		accel_knobs.h
accel_util.h		accel_util.h
asic.cpp		asic.cpp
asic.hh		asic.hh
asic_core.cpp		asic_core.cpp
asic_core.hh		asic_core.hh
asim-defs.h		asim-defs.h
common.hh		common.hh
config.cpp		config.cpp
config.hh		config.hh
example_run.py		example_run.py
multiply.cpp		multiply.cpp
multiply.hh		multiply.hh
network.cpp		network.cpp
network.hh		network.hh
replacement_state.cpp		replacement_state.cpp
replacement_state.h		replacement_state.h
simple_cache.cpp		simple_cache.cpp
simple_cache.h		simple_cache.h
simple_cache_defs.h		simple_cache_defs.h
stats.cpp		stats.cpp
stats.hh		stats.hh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Source file information

Simulator Usage

Statistics

Simulator knobs

Graphsim components

About

Releases

Packages

Languages

License

PolyArch/graphsim-simulator

Folders and files

Latest commit

History

Repository files navigation

Source file information

Simulator Usage

Statistics

Simulator knobs

Graphsim components

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages