add benchmarks #17

ev-br · 2016-03-16T10:24:43Z

and actually compare to, say, dok_matrix.

The text was updated successfully, but these errors were encountered:

pv · 2016-03-28T11:24:29Z

Probably best to compare against pysparse.ll_mat

pv · 2016-03-28T11:26:46Z

Also, memory usage would be important to understand and document

ev-br · 2016-03-28T13:22:19Z

Very crude for now, just %timeit and %memit, using the example from pysparse docs:
https://github.com/ev-br/sparr/blob/master/benchmarks.ipynb

Basically, this is now about 2x slower than ll_mat for single-element __setitem__ + python looping overhead. The memory consumption is not great, but not terrible either, at least on this problem size.

pv · 2016-03-28T14:15:52Z

Also see scipy/scipy#6004

pv · 2016-03-28T14:32:44Z

The memory usage may be a non-trivial disadvantage for std::map

ev-br · 2016-03-28T14:50:26Z

It could be, yes. Like I was saying over at scipy issue, I wonder what's the deal with the Wikipedia link map: it's being invoked for both sides of the argument.

ev-br · 2016-03-28T14:57:44Z

On a more serious note, there's no question that a sorted vector of sorted vectors wins if there's an idea of the number of rows/columns. Otherwise it's either reallocation or back to essentially the same memory as std::map, no?

maciejkula · 2016-03-28T15:03:57Z

Vec of vecs has a per-row overhead, whereas std::map has a per-entry overhead. This matters if you have a lot of entries.

ev-br · 2016-03-28T15:08:12Z

But you have to reallocate vectors if there's an insertion into a row which was previously empty and there were non-empty rows from both sides of it?

maciejkula · 2016-03-28T15:11:24Z

In scipy/scipy#6004 I pre-allocate all the row vectors, so I pay the per-vector overhead up-front.

ev-br · 2016-03-28T15:13:21Z

Makes sense --- so you require an a priori estimate of the number of rows?

maciejkula · 2016-03-28T15:13:45Z

I just realised you've been working on this recently, apologies for stepping on your toes! I scanned scipy PRs so as to not duplicate work, but yours was separate :(

maciejkula · 2016-03-28T15:14:14Z

Yes. This can be relaxed with some work.

ev-br · 2016-03-28T15:15:14Z

Re: toes: No problem whatsoever! Hope you don't feel I'm stepping on yours.

ev-br · 2016-03-28T15:19:14Z

In your solution with preallocation: you can probably just have a second vector with row indices. The overhead is at most nrows, but there's no reallocation at all.

maciejkula · 2016-03-28T15:26:44Z

Yep, or have a resize call that allocates new vectors when necessary.

ev-br · 2016-04-12T15:19:21Z

#40 (comment) for some numbers on constructing a Possoin 2D matrix from pysparse docs.

I've no idea why memory benchmarks report all zeros. From peakmem, fast_lil_mat actually looks good :-). Timewise, ll_mat wins hands down.

ev-br added the maintenance label Mar 16, 2016

ev-br mentioned this issue Mar 16, 2016

investigate the storage scheme #18

Open

ev-br added feature-completeness and removed maintenance labels Apr 3, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add benchmarks #17

add benchmarks #17

ev-br commented Mar 16, 2016

pv commented Mar 28, 2016

pv commented Mar 28, 2016

ev-br commented Mar 28, 2016

pv commented Mar 28, 2016

pv commented Mar 28, 2016

ev-br commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Apr 12, 2016

add benchmarks #17

add benchmarks #17

Comments

ev-br commented Mar 16, 2016

pv commented Mar 28, 2016

pv commented Mar 28, 2016

ev-br commented Mar 28, 2016

pv commented Mar 28, 2016

pv commented Mar 28, 2016

ev-br commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Mar 28, 2016

ev-br commented Mar 28, 2016

maciejkula commented Mar 28, 2016

ev-br commented Apr 12, 2016