Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

digits Slurm #1435

Open
wants to merge 106 commits into
base: master
Choose a base branch
from
Open

digits Slurm #1435

wants to merge 106 commits into from

Commits on Nov 28, 2016

  1. init

    Wolf Zimmermann committed Nov 28, 2016
    Configuration menu
    Copy the full SHA
    50e8a8a View commit details
    Browse the repository at this point in the history
  2. basic slurm tasks and detection

    Wolf Zimmermann committed Nov 28, 2016
    Configuration menu
    Copy the full SHA
    9a49297 View commit details
    Browse the repository at this point in the history
  3. adding new gitingnore

    Wolf Zimmermann committed Nov 28, 2016
    Configuration menu
    Copy the full SHA
    b6eb54e View commit details
    Browse the repository at this point in the history
  4. saving before branch

    Wolf Zimmermann committed Nov 28, 2016
    Configuration menu
    Copy the full SHA
    0be0458 View commit details
    Browse the repository at this point in the history

Commits on Nov 29, 2016

  1. overide gpu check

    Wolf Zimmermann committed Nov 29, 2016
    Configuration menu
    Copy the full SHA
    5b1c1c5 View commit details
    Browse the repository at this point in the history
  2. minor change to gpu selector

    Wolf Zimmermann committed Nov 29, 2016
    Configuration menu
    Copy the full SHA
    fb1df8d View commit details
    Browse the repository at this point in the history
  3. line ending issues

    Wolf Zimmermann committed Nov 29, 2016
    Configuration menu
    Copy the full SHA
    7f57546 View commit details
    Browse the repository at this point in the history

Commits on Nov 30, 2016

  1. Work around for inference

    Wolf Zimmermann committed Nov 30, 2016
    Configuration menu
    Copy the full SHA
    7ea2a02 View commit details
    Browse the repository at this point in the history

Commits on Dec 1, 2016

  1. Configuration menu
    Copy the full SHA
    0d8185a View commit details
    Browse the repository at this point in the history
  2. fixed gpu selector to only show when not in slurm mode

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    3acd953 View commit details
    Browse the repository at this point in the history
  3. removed junk

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    95fa40a View commit details
    Browse the repository at this point in the history
  4. fixed line endings

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    c47fde8 View commit details
    Browse the repository at this point in the history
  5. fixing build lint

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    830da0b View commit details
    Browse the repository at this point in the history
  6. formatting

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    2d8af6d View commit details
    Browse the repository at this point in the history
  7. more build fixes

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    e0c0353 View commit details
    Browse the repository at this point in the history
  8. fixing lint issues

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    f882924 View commit details
    Browse the repository at this point in the history
  9. check build test

    Wolf Zimmermann committed Dec 1, 2016
    Configuration menu
    Copy the full SHA
    b149331 View commit details
    Browse the repository at this point in the history

Commits on Dec 2, 2016

  1. changed test file back

    Wolf Zimmermann committed Dec 2, 2016
    Configuration menu
    Copy the full SHA
    27b3b8c View commit details
    Browse the repository at this point in the history
  2. merged with nvidia master

    Wolf Zimmermann committed Dec 2, 2016
    Configuration menu
    Copy the full SHA
    bfc3fe5 View commit details
    Browse the repository at this point in the history
  3. added check box for slurm

    Wolf Zimmermann committed Dec 2, 2016
    Configuration menu
    Copy the full SHA
    d4f4040 View commit details
    Browse the repository at this point in the history

Commits on Dec 5, 2016

  1. slurm flags

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    ce59562 View commit details
    Browse the repository at this point in the history
  2. check

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    dc90667 View commit details
    Browse the repository at this point in the history
  3. removed task debug code

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    f222935 View commit details
    Browse the repository at this point in the history
  4. fix lint

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    0339605 View commit details
    Browse the repository at this point in the history
  5. lint

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    99d1d47 View commit details
    Browse the repository at this point in the history
  6. lint

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    7afc30f View commit details
    Browse the repository at this point in the history
  7. lint

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    684a94e View commit details
    Browse the repository at this point in the history
  8. build.sh

    Wolf Zimmermann committed Dec 5, 2016
    Configuration menu
    Copy the full SHA
    6d41dd3 View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2016

  1. added gui changed for slurm db tasks

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    55b414b View commit details
    Browse the repository at this point in the history
  2. Merge remote-tracking branch 'upstream/master' into dev

    merge with upstream master
    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    b53b3c0 View commit details
    Browse the repository at this point in the history
  3. removed old jobs

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    7f82268 View commit details
    Browse the repository at this point in the history
  4. test

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    299f487 View commit details
    Browse the repository at this point in the history
  5. Minor lint fix

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    facde02 View commit details
    Browse the repository at this point in the history
  6. job numbers in gui

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    bb8c1ee View commit details
    Browse the repository at this point in the history
  7. build fix

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    26a99a9 View commit details
    Browse the repository at this point in the history
  8. fix build

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    dec9bde View commit details
    Browse the repository at this point in the history
  9. fixed setup test

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    fc82aff View commit details
    Browse the repository at this point in the history
  10. test

    Wolf Zimmermann committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    36561b1 View commit details
    Browse the repository at this point in the history

Commits on Dec 7, 2016

  1. set db tasks back to int mode

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    2b9f63c View commit details
    Browse the repository at this point in the history
  2. Fix s_mem not being popped

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    ee7eff4 View commit details
    Browse the repository at this point in the history
  3. enabled slurm db tasks

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    7d2a599 View commit details
    Browse the repository at this point in the history
  4. exceptions for jobs

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    8d6af9b View commit details
    Browse the repository at this point in the history
  5. testing

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    2b0c3f1 View commit details
    Browse the repository at this point in the history
  6. jenkins build

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    29ff26f View commit details
    Browse the repository at this point in the history
  7. Debug print statements for jenkins

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    b470ee8 View commit details
    Browse the repository at this point in the history
  8. more debug

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    e102d79 View commit details
    Browse the repository at this point in the history
  9. Debug task jenkinds

    Wolf Zimmermann committed Dec 7, 2016
    Configuration menu
    Copy the full SHA
    b55b284 View commit details
    Browse the repository at this point in the history

Commits on Dec 8, 2016

  1. Testing digits.dataset.tasks.create_generic_db.CreateGenericDbTask ex…

    …cluded from slurm
    Wolf Zimmermann committed Dec 8, 2016
    Configuration menu
    Copy the full SHA
    966abdd View commit details
    Browse the repository at this point in the history
  2. tidy of task - issue of db tasks not working on slurm is still on goi…

    …ng have excluded for now
    Wolf Zimmermann committed Dec 8, 2016
    Configuration menu
    Copy the full SHA
    34d72be View commit details
    Browse the repository at this point in the history
  3. Inference excluded to make DB errors clear

    Wolf Zimmermann committed Dec 8, 2016
    Configuration menu
    Copy the full SHA
    eb8fcf4 View commit details
    Browse the repository at this point in the history

Commits on Dec 9, 2016

  1. set slurm tmp dir - this fixes the slurm chdir errors as the envar TM…

    …PDIR points to node local storage
    Wolf Zimmermann committed Dec 9, 2016
    Configuration menu
    Copy the full SHA
    fbc1760 View commit details
    Browse the repository at this point in the history
  2. S1

    Wolf Zimmermann committed Dec 9, 2016
    Configuration menu
    Copy the full SHA
    191c710 View commit details
    Browse the repository at this point in the history

Commits on Dec 11, 2016

  1. Fixed caffe slurm timeout errors

    Wolf Zimmermann committed Dec 11, 2016
    Configuration menu
    Copy the full SHA
    c12b939 View commit details
    Browse the repository at this point in the history
  2. re-enabled slurm

    Wolf Zimmermann committed Dec 11, 2016
    Configuration menu
    Copy the full SHA
    9a9bec4 View commit details
    Browse the repository at this point in the history

Commits on Dec 12, 2016

  1. changed setting for slurm

    Wolf Zimmermann committed Dec 12, 2016
    Configuration menu
    Copy the full SHA
    0252800 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4331613 View commit details
    Browse the repository at this point in the history

Commits on Dec 13, 2016

  1. end of day

    Wolf Zimmermann committed Dec 13, 2016
    Configuration menu
    Copy the full SHA
    ec22501 View commit details
    Browse the repository at this point in the history

Commits on Dec 15, 2016

  1. torch works

    Wolf Zimmermann committed Dec 15, 2016
    Configuration menu
    Copy the full SHA
    a1e3f1f View commit details
    Browse the repository at this point in the history
  2. Changed timeouts

    Wolf Zimmermann committed Dec 15, 2016
    Configuration menu
    Copy the full SHA
    53a2137 View commit details
    Browse the repository at this point in the history
  3. time limit form fix

    Wolf Zimmermann committed Dec 15, 2016
    Configuration menu
    Copy the full SHA
    a108a01 View commit details
    Browse the repository at this point in the history

Commits on Dec 16, 2016

  1. Fixed up inference

    Wolf Zimmermann committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    8ecadf7 View commit details
    Browse the repository at this point in the history
  2. .gitignore is now working

    Wolf Zimmermann committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    2292144 View commit details
    Browse the repository at this point in the history
  3. .gitignore is now working

    Wolf Zimmermann committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    7e16d53 View commit details
    Browse the repository at this point in the history
  4. .gitignore is now working

    Wolf Zimmermann committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    a7c96e1 View commit details
    Browse the repository at this point in the history

Commits on Dec 18, 2016

  1. removed redundant code from generic.veiws.py

    Wolf Zimmermann committed Dec 18, 2016
    Configuration menu
    Copy the full SHA
    2a1facb View commit details
    Browse the repository at this point in the history

Commits on Dec 19, 2016

  1. Revert "Changed timeouts"

    This reverts commit 53a2137.
    
    Tracking down errors with this commit
    Wolf Zimmermann committed Dec 19, 2016
    Configuration menu
    Copy the full SHA
    d75d246 View commit details
    Browse the repository at this point in the history
  2. lint fix

    Wolf Zimmermann committed Dec 19, 2016
    Configuration menu
    Copy the full SHA
    ec5932c View commit details
    Browse the repository at this point in the history

Commits on Dec 21, 2016

  1. Refactored cluster management into classes

    Wolf Zimmermann committed Dec 21, 2016
    Configuration menu
    Copy the full SHA
    480e449 View commit details
    Browse the repository at this point in the history
  2. Refactored cluster management into classes

    Wolf Zimmermann committed Dec 21, 2016
    Configuration menu
    Copy the full SHA
    bbc8004 View commit details
    Browse the repository at this point in the history
  3. fixing up system types

    Wolf Zimmermann committed Dec 21, 2016
    Configuration menu
    Copy the full SHA
    7b2928c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0b84511 View commit details
    Browse the repository at this point in the history
  5. make jenkins set slurm

    Wolf Zimmermann committed Dec 21, 2016
    Configuration menu
    Copy the full SHA
    122a3cc View commit details
    Browse the repository at this point in the history

Commits on Dec 22, 2016

  1. fix for node local tmpdir

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    cdbb196 View commit details
    Browse the repository at this point in the history
  2. inference working

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    e63aa64 View commit details
    Browse the repository at this point in the history
  3. testing jenkins issues

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    d1c9c9b View commit details
    Browse the repository at this point in the history
  4. more debug

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    11fe393 View commit details
    Browse the repository at this point in the history
  5. cast gpu for caffe to int

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    1a835b2 View commit details
    Browse the repository at this point in the history
  6. fixed gpu --gpu=all

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    46864a1 View commit details
    Browse the repository at this point in the history
  7. trying to fix cuda errors

    Wolf Zimmermann committed Dec 22, 2016
    Configuration menu
    Copy the full SHA
    c69ff3b View commit details
    Browse the repository at this point in the history

Commits on Dec 23, 2016

  1. fixed gpu issues

    Wolf Zimmermann committed Dec 23, 2016
    Configuration menu
    Copy the full SHA
    8bd5d0e View commit details
    Browse the repository at this point in the history

Commits on Jan 2, 2017

  1. inf testing

    Wolf Zimmermann committed Jan 2, 2017
    Configuration menu
    Copy the full SHA
    3cfe233 View commit details
    Browse the repository at this point in the history
  2. inf change gpu to id 0

    Wolf Zimmermann committed Jan 2, 2017
    Configuration menu
    Copy the full SHA
    da6b9f2 View commit details
    Browse the repository at this point in the history

Commits on Jan 3, 2017

  1. fixed gpu selection for generic tasks

    Wolf Zimmermann committed Jan 3, 2017
    Configuration menu
    Copy the full SHA
    a6d8c58 View commit details
    Browse the repository at this point in the history

Commits on Jan 4, 2017

  1. gpu fix

    Wolf Zimmermann committed Jan 4, 2017
    Configuration menu
    Copy the full SHA
    f9358c2 View commit details
    Browse the repository at this point in the history
  2. Got changes from login branch

    Wolf Zimmermann committed Jan 4, 2017
    Configuration menu
    Copy the full SHA
    b1eaf3c View commit details
    Browse the repository at this point in the history
  3. testing

    Wolf Zimmermann committed Jan 4, 2017
    Configuration menu
    Copy the full SHA
    0adf134 View commit details
    Browse the repository at this point in the history

Commits on Jan 10, 2017

  1. Fixed dataset pages

    Wolf Zimmermann committed Jan 10, 2017
    Configuration menu
    Copy the full SHA
    a062372 View commit details
    Browse the repository at this point in the history
  2. added check for cudaDeviceGetPCIBusId()

    Wolf Zimmermann committed Jan 10, 2017
    Configuration menu
    Copy the full SHA
    27a0a66 View commit details
    Browse the repository at this point in the history

Commits on Jan 11, 2017

  1. gpu debugging

    Wolf Zimmermann committed Jan 11, 2017
    Configuration menu
    Copy the full SHA
    6efb6d2 View commit details
    Browse the repository at this point in the history

Commits on Jan 12, 2017

  1. fixed digits job number

    Wolf Zimmermann committed Jan 12, 2017
    Configuration menu
    Copy the full SHA
    6e76413 View commit details
    Browse the repository at this point in the history

Commits on Jan 16, 2017

  1. Updated tasks default and cluster management layout

    Wolf Zimmermann committed Jan 16, 2017
    Configuration menu
    Copy the full SHA
    d5e8210 View commit details
    Browse the repository at this point in the history
  2. Fixed cluster manager

    Wolf Zimmermann committed Jan 16, 2017
    Configuration menu
    Copy the full SHA
    5b950d6 View commit details
    Browse the repository at this point in the history
  3. refactor job cancel into cluster manager class

    Wolf Zimmermann committed Jan 16, 2017
    Configuration menu
    Copy the full SHA
    b212d64 View commit details
    Browse the repository at this point in the history
  4. removed prints

    Wolf Zimmermann committed Jan 16, 2017
    Configuration menu
    Copy the full SHA
    989b9b5 View commit details
    Browse the repository at this point in the history

Commits on Jan 17, 2017

  1. Merge remote-tracking branch 'upstream/master' into dev

    Wolf Zimmermann committed Jan 17, 2017
    Configuration menu
    Copy the full SHA
    b7bc101 View commit details
    Browse the repository at this point in the history
  2. updated

    Wolf Zimmermann committed Jan 17, 2017
    Configuration menu
    Copy the full SHA
    70cc3f4 View commit details
    Browse the repository at this point in the history
  3. formatting

    Wolf Zimmermann committed Jan 17, 2017
    Configuration menu
    Copy the full SHA
    3b2001f View commit details
    Browse the repository at this point in the history
  4. formatting

    Wolf Zimmermann committed Jan 17, 2017
    Configuration menu
    Copy the full SHA
    16baab5 View commit details
    Browse the repository at this point in the history
  5. more formatting

    Wolf Zimmermann committed Jan 17, 2017
    Configuration menu
    Copy the full SHA
    80207b2 View commit details
    Browse the repository at this point in the history
  6. Even more formatting

    Wolf Zimmermann committed Jan 17, 2017
    Configuration menu
    Copy the full SHA
    5d6bfe4 View commit details
    Browse the repository at this point in the history

Commits on Jan 18, 2017

  1. merge

    Wolf Zimmermann committed Jan 18, 2017
    Configuration menu
    Copy the full SHA
    2f30e4e View commit details
    Browse the repository at this point in the history

Commits on Jan 23, 2017

  1. fixed error code

    Wolf Zimmermann committed Jan 23, 2017
    Configuration menu
    Copy the full SHA
    82e76a3 View commit details
    Browse the repository at this point in the history

Commits on Apr 26, 2017

  1. Merge branch 'master' of https://github.com/NVIDIA/DIGITS into dev

    Sync fork
    Wolf Zimmermann committed Apr 26, 2017
    Configuration menu
    Copy the full SHA
    8296a51 View commit details
    Browse the repository at this point in the history
  2. removed log file

    Wolf Zimmermann committed Apr 26, 2017
    Configuration menu
    Copy the full SHA
    edc101e View commit details
    Browse the repository at this point in the history

Commits on May 3, 2017

  1. fix travis errors

    Wolf Zimmermann committed May 3, 2017
    Configuration menu
    Copy the full SHA
    cd882e8 View commit details
    Browse the repository at this point in the history
  2. s_mem in generic jobs

    Wolf Zimmermann committed May 3, 2017
    Configuration menu
    Copy the full SHA
    cd6864d View commit details
    Browse the repository at this point in the history