Update build scripts to always run tests for all MXNet engine types #71

joseph-wakeling-sociomantic · 2018-06-21T14:32:14Z

This patch updates Build.mak to ensure that unittests and integration tests will always be run for all of the supported engine types. As well as making it easier to run tests for all engines locally, this will make it possible to greatly simplify our CI setup.

This patch updates `Build.mak` to ensure that unittests and integration tests will always be run for all of the supported engine types. As well as making it easier to run tests for all engines locally, this will make it possible to greatly simplify our CI setup.

joseph-wakeling-sociomantic · 2018-06-21T14:32:52Z

I'm going to follow up with a separate travis config patch. For now I want to see the behavioural difference without it ;-)

Since `Build.mak` now takes care of running tests with all engines, we can greatly simplify the number and configuration of Travis CI jobs. This patch reduces the setup to cover development and production builds for D1 and D2 respectively.

joseph-wakeling-sociomantic · 2018-06-21T14:48:28Z

Build.mak

+$O/test-mxnet.stamp: $O/test-mxnet download-mnist
+	$(call run_test,$<,NaiveEngine)
+	$(call run_test,$<,ThreadedEngine)
+	$(call run_test,$<,ThreadedEnginePerDevice)


It occurs to me that an even better option would be to have a variable defining the supported engines (set in Config.mak if not already defined) and just foreach over the entries in that variable. But I'm finding it tricky to work out how best to do that :-\

Specifically: how to implement an appropriate for loop. Although I guess a simple, manual way is to have a separate run_test_with_all_engines function which takes a single parameter (the test to run).

I've tried with:

run_test_with_all_engines = $(foreach engine,$(SUPPORTED_MXNET_ENGINES),$(call run_test,$1,$(engine)))

... but for some reason, this only ever picks up on the first entry in the SUPPORTED_MXNET_ENGINES variable.

BTW, the dependency of $O/test-mxnet is added automatically by Makd, you don't need to put it explicitly.

This is what I assumed, but it didn't work without it. :-\

joseph-wakeling-sociomantic · 2018-06-21T14:51:39Z

I've pushed a follow-up patch to simplify the Travis config.

leandro-lucarella-sociomantic

LGTM, although I suggest to still provide the option to run one individual test, in case only one fails, so you don't have to repeat all of them.

leandro-lucarella-sociomantic · 2018-06-21T15:05:02Z

(I would also squash)

joseph-wakeling-sociomantic · 2018-06-21T15:07:54Z

LGTM, although I suggest to still provide the option to run one individual test, in case only one fails, so you don't have to repeat all of them.

Agreed -- it's what I was trying to do by defining a variable defining the engines to test. Do you have any suggestions for how to do the kind of foreach loop I had in mind above?

… types

joseph-wakeling-sociomantic · 2018-06-21T17:18:57Z

I've pushed a squash! patch that I think gets things working w.r.t. allowing the engines to be specified via an environment variable. @mihails-strasuns-sociomantic @leandro-lucarella-sociomantic what do you think? I suspect it's probably possible to simplify.

… types

joseph-wakeling-sociomantic · 2018-06-21T17:21:10Z

... and one more minor tweak.

joseph-wakeling-sociomantic · 2018-06-21T17:22:22Z

Build.mak

 # extra runtime dependencies for integration tests
 $O/test-mxnet.stamp: override ITFLAGS += $(MNIST_DATA_DIR)
 $O/test-mxnet.stamp: download-mnist
+	$Vtouch $@ # override default implementation


This tweak (and the similar one for unittests) is necessary to avoid the "normal" integration test run from also taking place. Would be nice to avoid if possible though.

Another approach is to have the MXNet default engine always executed and then change TEST_MXNET_ENGINES to mean additional engines to test.

Yes. It would probably be best in that case to remove the MXNET_ENGINE_TYPE ?= line from Config.mak as then we test the default case where the engine is not specified in the environment.

That would also avoid the need to override the default environment with lines like

$O/allunittests.stamp: $Vtouch $@

... so it's probably better this way.

HOWEVER, note that this then messes up things from another point of view, which is: what if I want to specify just one engine to test with, not as an extra?

Obviously I can do MXNET_ENGINE_TYPE=MyEngine TEST_MXNET_ENGINES= make test all but this is getting finnicky. That's why from a CI perspective I'd rather have just one variable (the build script's variable) be the one the user has to care about.

joseph-wakeling-sociomantic · 2018-06-21T17:27:32Z

Build.mak

 # extra build dependencies for integration tests
 $O/test-mxnet: override LDFLAGS += -lz
 $O/test-mxnet: override DFLAGS += -debug=MXNetHandleManualFree

+# run integration tests with all specified engines
+$(eval $(call test_with_engines,$O/test-mxnet,$(TEST_MXNET_ENGINES)))


It'd be nice to be able to simplify these lines.

joseph-wakeling-sociomantic · 2018-06-21T17:27:51Z

Build.mak

+endef
+
+# helper function to generate targets for per-engine test runs
+test_with_engines = $(foreach engine,$2,\


Probably here we can drop $2 and just use $(TEST_MXNET_ENGINES) directly.

… types

joseph-wakeling-sociomantic · 2018-06-21T17:32:02Z

Build.mak

 # extra build dependencies for integration tests
 $O/test-mxnet: override LDFLAGS += -lz
 $O/test-mxnet: override DFLAGS += -debug=MXNetHandleManualFree

+# run integration tests with all specified engines
+$(eval $(call test_with_engines,$O/test-mxnet))


Is there any way to avoid the $(eval $(call ... here? They both seem to be necessary, otherwise I get a missing separator error.

I don't think so. call is needed to run the function and eval is needed to evaluate the returned string describing the target (https://www.gnu.org/software/make/manual/html_node/Eval-Function.html#Eval-Function).

joseph-wakeling-sociomantic · 2018-06-21T17:35:41Z

Config.mak

@@ -1,5 +1,7 @@
 INTEGRATIONTEST := integrationtest

+TEST_MXNET_ENGINES ?= NaiveEngine ThreadedEngine ThreadedEnginePerDevice


One small note here: it should not be possible that the TEST_MXNET_ENGINES variable be empty. If I do:

TEST_MXNET_ENGINES= make test all

... then obviously, nothing runs. But there should be a failure in this case (which doesn't happen right now). Any thoughts on the best way to implement this?

I don't see why there should be a failure. Because with empty TEST_MXNET_ENGINES you define no additional targets.

@jens-mueller-sociomantic right now they are not ADDITIONAL targets. Of course, we could make it so. It just means that NaiveEngine tests will run twice. But if we kill the MXNET_ENGINE_TYPE ?= line in Config.mak, maybe that has sense (as we then test the case where MXNET_ENGINE_TYPE is not specified in the environment).

Perhaps define the TEST_MXNET_ENGINES variable to some "always fails" phony target by default, requiring that user override it.

@nemanja-boric-sociomantic I'm not sure it really fixes the issue, because the problem is precisely if the user overrides the variable to set it to empty (a phony target won't solve that).

Right, I was talking nonsense. Hm...

There's a way of defining default to be NativeEngine and filtering it out from the TEST_MXNET_ENGINES.

TBH I think this is overthinking things. If someone sets the field to empty that's their problem ...

joseph-wakeling-sociomantic · 2018-06-21T17:37:57Z

Build.mak

+
+# helper function to generate targets for per-engine test runs
+test_with_engines = $(foreach engine,$(TEST_MXNET_ENGINES),\
+	$(eval $(call run_test_with_engine,$1,$(engine))))


Both eval and call seem necessary here too to avoid a missing separator error.

jens-mueller-sociomantic · 2018-06-22T07:31:23Z

Build.mak

+$(1).stamp: $(1)-$(2).stamp
+
+$(1)-$(2).stamp: $1
+	$(call exec,MXNET_ENGINE_TYPE=$2 $1,$1,$2)


Don't you miss a $Vtouch $(1)-$(2).stamp here?

Yes, most likely. I think I tried using $@ and there was some objection to that (which I assume is about ordering of make's expansion of these tokens).

joseph-wakeling-sociomantic · 2018-06-22T09:51:00Z

Build.mak


 $O/%unittests: override LDFLAGS += -lz
+
+# run unittests with all specified engines
+$(eval $(call test_with_engines,$O/allunittests))


Minor note: would it be worth killing the need to write $O here? i.e. have the calls be

$(eval $(call test_with_engines,allunittests) $(eval $(call test_with_engines,test-mxnet)

... and do the $O stuff under the hood ... ?

… types This should get the dependencies working OK.

joseph-wakeling-sociomantic · 2018-06-22T15:05:01Z

I've pushed a further fixup patch. @leandro-lucarella-sociomantic @nemanja-boric-sociomantic you may want to review this solution ;-)

joseph-wakeling-sociomantic · 2018-06-22T15:09:16Z

Build.mak

-$O/test-mxnet.stamp: override ITFLAGS += $(MNIST_DATA_DIR)
-$O/test-mxnet.stamp: download-mnist
+$(eval $(call run_test_with_dependency,$O/test-mxnet,\
+	override ITFLAGS += $(MNIST_DATA_DIR)))


I'm not convinced that this is working. The output of make -p gives e.g.:

build/devel/tmp/test-mxnet-NaiveEngine.stamp: ITFLAGS += ~/data/mnist

(where ~ is expanded;-) but the integration tests (and download script) both use the default value of this parameter, despite it being set in my Config.local.mak.

That said, this might be something other than how this Build.mak is written.

I've just validated with current v0.x.x, where things are working w.r.t. running test-mxnet, but not in terms of where the download script places things (which appears to be an existing Build.mak bug). So there seems to be some problem with how this override is being handled.

It looks like it's because the custom rule implemented in test_with_engine is missing the details used in the default makd integration test run:

# General rule to Run the test suite binaries $O/test-%.stamp: $O/test-% $(call exec,$< $(_test_drt_opts) $(ITFLAGS),$<,run) $Vtouch $@

Now, I can easily add $$(_test_drt_opts) $$(ITFLAGS) to the call implemented here in Build.mak, but note that this is supposed to work for unittests as well as integration tests and there are no similar settings in vanilla makd where unittests are concerned.

@leandro-lucarella-sociomantic @nemanja-boric-sociomantic any advice/thoughts?

_test_drt_opts are common, I think, but unittests versus integration-tests should use UTFLAGS versus ITFLAGS respectively. I guess I could define a function to work out (based on the full target name) which options should be used. But it really feels like this is starting to turn into a nasty code duplication between how this Build.mak is doing things, versus how makd is doing things.

TBH, my feeling is you have reached the level of the complexity where we should consider supporting this as a makd feature. @leandro-lucarella-sociomantic

Yes, I agree, this looks like it needs makd support.

The question is what level of makd support, though. It's not clear to me that it needs to be at the level of actually supporting the feature needed here (i.e. running tests multiple times for different environment settings). It might suffice to provide make functions to run the tests associated with a particular .stamp target (so that create a custom recipe without having to do a complete re-duplication of what makd already does).

@leandro-lucarella-sociomantic any thoughts? (Including "Drop this for now and get on with more actionable items" :-P)

@leandro-lucarella-sociomantic your input is needed here :-)

joseph-wakeling-sociomantic · 2018-06-22T15:10:19Z

Build.mak

+	download-mnist))
+
+# run integration tests with all specified engines
+$(eval $(call run_test_with_engines,$O/test-mxnet))


A weird behaviour: right now make test always re-runs the integration tests, despite the associated .stamp files being correctly generated. Any idea what could be responsible?

OK, it looks like the download-mnist phony target is responsible, since there is no associated .stamp. It's independent of this PR, just more noticeable.

joseph-wakeling-sociomantic · 2018-06-22T15:10:47Z

Build.mak


 $O/%unittests: override LDFLAGS += -lz
+
+# run unittests with all specified engines
+$(eval $(call run_test_with_engines,$O/allunittests))


OTOH this works just fine, with no re-running of unittests after they have been successfully run once.

joseph-wakeling-sociomantic · 2018-06-22T16:08:48Z

Considering how many successive issues we're uncovering, I'm starting to question whether this PR is worth the complications it introduces.

I wonder whether it might be worth trying to distill the essence of what is wanted here (i.e. ability to replay both unittests and integration tests with multiple different runtime environment settings), and implement support directly in makd, to ensure no conflicts or duplication of effort.

joseph-wakeling-sociomantic · 2018-06-26T10:05:58Z

Removing the milestone and lowering the priority, as I think we should take a bit of a step back from this: so far it's proving more complicated than it's worth for the benefit. Would still be interested in addressing the slightly bigger picture feature properly at some point, though.

leandro-lucarella-sociomantic · 2018-06-26T11:22:57Z

I've pushed a further fixup patch. @leandro-lucarella-sociomantic @nemanja-boric-sociomantic you may want to review this solution ;-)

I saw @nemanja-boric-sociomantic already reviewed this, I will skip it then . If you still need my input pplease mention me again!

joseph-wakeling-sociomantic · 2018-06-26T11:28:27Z

@leandro-lucarella-sociomantic it would be good to have your input on the outcome of our discussions, in particular w.r.t. our discussion point here: #71 (comment)

... but I don't think you need to approach this with any urgency. I can also file an upstream makd issue distilling the requirements (which could be addressed either with a concrete how-to-do-it-downstream solution, or by makd features, or a bit of both, depending on what's required).

leandro-lucarella-sociomantic · 2018-06-26T12:41:54Z

To be honest the discussion is too long and it would take a lot of time to go through it, so if it is an option to write a more generic and summarized issue in makd, that would be the best option IMHO (if appropriate, i.e. if there is anything wrong with it or a new feature is needed).

joseph-wakeling-sociomantic added the type-ci label Jun 21, 2018

joseph-wakeling-sociomantic added this to the v0.4.0 milestone Jun 21, 2018

joseph-wakeling-sociomantic requested review from mihails-strasuns-sociomantic, jens-mueller-sociomantic and leandro-lucarella-sociomantic June 21, 2018 14:33

joseph-wakeling-sociomantic commented Jun 21, 2018

View reviewed changes

leandro-lucarella-sociomantic previously approved these changes Jun 21, 2018

View reviewed changes

squash! Update build scripts to always run tests for all MXNet engine…

201fe10

… types

joseph-wakeling-sociomantic dismissed leandro-lucarella-sociomantic’s stale review via 201fe10 June 21, 2018 17:17

squash! Update build scripts to always run tests for all MXNet engine…

9370ffd

… types

joseph-wakeling-sociomantic commented Jun 21, 2018

View reviewed changes

squash! Update build scripts to always run tests for all MXNet engine…

38c3e08

… types

joseph-wakeling-sociomantic commented Jun 21, 2018

View reviewed changes

jens-mueller-sociomantic reviewed Jun 22, 2018

View reviewed changes

joseph-wakeling-sociomantic commented Jun 22, 2018

View reviewed changes

squash! Update build scripts to always run tests for all MXNet engine…

c8d44c2

… types This should get the dependencies working OK.

joseph-wakeling-sociomantic commented Jun 22, 2018

View reviewed changes

joseph-wakeling-sociomantic mentioned this pull request Jun 25, 2018

download-mnist script does not pick up on custom MNIST_DATA_DIR #74

Closed

joseph-wakeling-sociomantic added the prio-low label Jun 26, 2018

joseph-wakeling-sociomantic removed this from the v0.4.0 milestone Jun 26, 2018

		@@ -1,5 +1,7 @@
		INTEGRATIONTEST := integrationtest

		TEST_MXNET_ENGINES ?= NaiveEngine ThreadedEngine ThreadedEnginePerDevice

Update build scripts to always run tests for all MXNet engine types #71

Are you sure you want to change the base?

Update build scripts to always run tests for all MXNet engine types #71

Conversation

joseph-wakeling-sociomantic commented Jun 21, 2018

joseph-wakeling-sociomantic commented Jun 21, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joseph-wakeling-sociomantic commented Jun 21, 2018

leandro-lucarella-sociomantic left a comment

Choose a reason for hiding this comment

leandro-lucarella-sociomantic commented Jun 21, 2018

joseph-wakeling-sociomantic commented Jun 21, 2018

joseph-wakeling-sociomantic commented Jun 21, 2018

joseph-wakeling-sociomantic commented Jun 21, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jens-mueller-sociomantic Jun 22, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joseph-wakeling-sociomantic commented Jun 22, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joseph-wakeling-sociomantic commented Jun 22, 2018

joseph-wakeling-sociomantic commented Jun 26, 2018

leandro-lucarella-sociomantic commented Jun 26, 2018

joseph-wakeling-sociomantic commented Jun 26, 2018

leandro-lucarella-sociomantic commented Jun 26, 2018

joseph-wakeling-sociomantic commented Jun 21, 2018 •

edited

Loading

jens-mueller-sociomantic Jun 22, 2018 •

edited

Loading