Merge: write output to block files #34

amosr · 2016-12-12T07:52:56Z

No description provided.

amosr · 2016-12-12T07:53:05Z

amosr · 2016-12-12T07:53:33Z

csrc/zebra_append.c

@@ -0,0 +1,186 @@
+#include "zebra_append.h"


This is just splitting files, you can ignore it

amosr · 2016-12-12T07:54:45Z

csrc/zebra_append.c

+    return ZEBRA_SUCCESS;
+}
+
+error_t zebra_append_block_entity (anemone_mempool_t *pool, zebra_entity_t *entity, zebra_block_t **inout_block)


This is new code

amosr · 2016-12-12T07:55:57Z

csrc/zebra_block_split.c

@@ -0,0 +1,189 @@
+#include "zebra_block_split.h"


This is just splitting files, you can ignore it

amosr · 2016-12-12T07:56:21Z

csrc/zebra_block_split.c

+
+    in_table->row_count -= n_rows;
+    out_table->row_count = n_rows;
+    out_table->row_capacity = n_rows;


Except for adding this line

amosr · 2016-12-12T07:56:32Z

csrc/zebra_clone.c

@@ -0,0 +1,214 @@
+#include "zebra_clone.h"


This is just splitting files, you can ignore it

amosr · 2016-12-12T07:56:56Z

csrc/zebra_clone.c

+                into_data->d = ZEBRA_CLONE_ARRAY (pool, table_data->d, row_capacity );
+                break;
+            case ZEBRA_ARRAY:
+                into_data->a.n = ZEBRA_CLONE_ARRAY (pool, table_data->a.n, row_capacity );


Except for adding this line

amosr · 2016-12-12T07:57:21Z

csrc/zebra_grow.c

@@ -0,0 +1,105 @@
+#include "zebra_grow.h"


This is just splitting files, you can ignore it

amosr · 2016-12-12T07:57:43Z

csrc/zebra_debug.h

@@ -0,0 +1,102 @@
+#ifndef __ZEBRA_DEBUG_H


This is unused shit, you can ignore it

It's cool if you want to keep this, looks like it might be useful in the future

yeah, I'm pretty sure it'll be useful

jacobstanley · 2016-12-12T21:48:07Z

csrc/zebra_append.c

+        return ZEBRA_MERGE_DIFFERENT_ENTITIES;
+    }
+
+    block->entities = ZEBRA_GROW_ARRAY (pool, block->entities, block->entity_count, block->entity_count + 1);


is it right that we're reallocating here for every entity? perhaps we should track capacity for this array so we can do this more efficiently

yeah - I don't think we'd even need to track capacity since we can compute capacity based on current count. I just wanted the simplest thing possible, and I don't think this is worth spending time on while the read/write and convert is so slow

in a profile I just ran, appendEntityToBlock (which calls this function) has 0% of time. it would be adding extra pressure on the mempool but I think we have bigger issues.
I'd rather just add this as a TODO or issue so we don't forget about it, and move on to other bits

sounds good to me 👍

added issue #35

jacobstanley · 2016-12-12T21:50:06Z

csrc/zebra_append.c

+    block->times = ZEBRA_GROW_ARRAY (pool, block->times, block->row_count, new_row_count);
+    block->priorities = ZEBRA_GROW_ARRAY (pool, block->priorities, block->row_count, new_row_count);
+    block->tombstones = ZEBRA_GROW_ARRAY (pool, block->tombstones, block->row_count, new_row_count);
+    block->row_count = new_row_count;


same deal here, I guess the issue is that zebra_block_t wasn't designed with mutation in mind, but i don't see a problem with bolting on some capacity fields

jacobstanley · 2016-12-12T21:51:06Z

csrc/zebra_append.c

+        block = anemone_mempool_calloc (pool, 1, sizeof (zebra_block_t) );
+    } else if (entity->attribute_count != block->table_count) {
+        // TODO: better error
+        return ZEBRA_MERGE_DIFFERENT_ENTITIES;


Might as well fix this TODO

jacobstanley · 2016-12-12T21:55:41Z

main/zebra.hs

+    outfd <- lift $ IO.openBinaryFile out IO.WriteMode
+    lift $ Builder.hPutBuilder outfd (Serial.bHeader fileheader)
+
+    pool0    <- lift $ Mempool.create


the benefits of executables? 😆

yeah - actually I guess I can free this. I don't have the same problem as #31 since it's in an IORef

jacobstanley · 2016-12-12T21:57:12Z

test/Test/Zebra/Foreign/Block.hs

    firstT fromForeignError . foreignEntitiesOfBlock'

+prop_c_block_of_entities :: Property
+prop_c_block_of_entities =
+  gamble (noShrink jBlock) $ check_block_of_entities


should we remove noShrink now it's working?

jacobstanley · 2016-12-13T00:36:24Z

this is good btw 👍

… check so error is obvious

ambiata-ci assigned jacobstanley and tranma Dec 12, 2016