Tensorflow GPU 1.8 with MacOS 10.13.6
I keep getting the error at step 8
"-bash: cd: nccl_2.1.15-1+cuda9.1_x86_64: No such file or directory"
No sure if it is correct but I manually moved contents of "nccl_osx_1.3.4" to /usr/local/nccl
After I finished all the steps, it didn't work.
I am also getting the error at step 10;
First of all if I just type "chown -R $(whoami) *" I get many permission denied results. so I did sudo chown -R $(whoami) *
then when I do
make -C 1_Utilities/deviceQuery
I get the following result "make: Nothing to be done for `all' "
-bash: ./bin/x86_64/darwin/release/deviceQuery: No such file or directory
please follow this guide: https://gist.github.com/antoniopioricciardi/240921b16e32dcc67a779e11dfaade1d
Step 8 - I have to look into that (whether it's better to use nccl_2.1.15 or 1.3.4 as suggested elsewhere), however what you did should be good.
Step 10 - Okay
"make: Nothing to be done for `all' " - If I recall correctly had that message when I already compiled that file and then typed make. Try deleting compiled files.
Please let me know how it goes.
I could install everything and it is actually running and recognizing my egpu, but TensorFlow is not using almost anything from the GPU power, do you guys have any ideas?
I keep on getting this, but i'm not sure if it means something related to that.
What is funny is that when i ran a much deeper and complex Model while using Windows, each epoch ran in around 11 minutes.
I could check in activity monitor and the GPU actually shows a small peak at the beginning but then keeps on running with a very small load.
I could also notice that the GPU doesn't even get as hot as when i ran it on Windows
Using TensorFlow backend. 2019-06-05 01:02:46.561247: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:859] OS X does not support NUMA - returning NUMA node zero 2019-06-05 01:02:46.561408: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1356] Found device 0 with properties: name: GeForce GTX 1080 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.6325 pciBusID: 0000:c4:00.0 totalMemory: 11.00GiB freeMemory: 9.10GiB 2019-06-05 01:02:46.561421: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1435] Adding visible gpu devices: 0 2019-06-05 01:02:46.884558: I tensorflow/core/common_runtime/gpu/gpu_device.cc:923] Device interconnect StreamExecutor with strength 1 edge matrix: 2019-06-05 01:02:46.884577: I tensorflow/core/common_runtime/gpu/gpu_device.cc:929] 0 2019-06-05 01:02:46.884582: I tensorflow/core/common_runtime/gpu/gpu_device.cc:942] 0: N 2019-06-05 01:02:46.884660: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1053] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 8801 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1080 Ti, pci bus id: 0000:c4:00.0, compute capability: 6.1) 2019-06-05 01:02:47.039069: E tensorflow/core/grappler/clusters/utils.cc:127] Not found: TF GPU device with id 0 was not registered 2019-06-05 01:02:47.049827: E tensorflow/core/grappler/clusters/utils.cc:127] Not found: TF GPU device with id 0 was not registered ---------------Epoch 1--------------- Found 75750 images belonging to 101 classes. Found 25250 images belonging to 101 classes. 0%| | 0/1184 [00:00<?, ?it/s]2019-06-05 01:02:57.059551: E tensorflow/core/grappler/clusters/utils.cc:127] Not found: TF GPU device with id 0 was not registered 2019-06-05 01:02:57.145434: E tensorflow/core/grappler/clusters/utils.cc:127] Not found: TF GPU device with id 0 was not registered 2019-06-05 01:02:57.250595: E tensorflow/core/grappler/clusters/utils.cc:127] Not found: TF GPU device with id 0 was not registered 0%|9 | 5/1184 [00:19<1:19:39, 4.05s/it]
Good work on putting this together. I am no expert but I am trying to set-up my 2017 MBP 15" /Nvidia 1080TI/Mantiz/10.13.6 for use with Temsorflow. I have a working setup (hardware) and a fully functional GPU.
I too am having issues with some parts of your installation. Most I have overcome but I am stuck on Step 10 as @btc2lambo was (not sure if he ever got passed it);
chown -R $(whoami) * # this works if you run as sudo otherwise it errors due to lack of permissions. make -C 1_Utilities/deviceQuery # this errors with "nothing to be done for 'all' " ./bin/x86_64/darwin/release/deviceQuery # I assume this line should form part of the above make command but when I include it I get the error " No rule to make target 'bin/x86_64/darwin/release/deviceQuey'. Stop."
in 1_Utilities/deviceQuery folder
already exists and has content so I'm not entirely sure why its failing.
What am I missing?
**** Update to previous post ****
Having delved a little deeper and thought about this I can now understand what is supposed to happen;
• CD to Developer/Nvidia/Cuda-9.1/Samples - takes the terminal into this working directory
• Chown - changes ownership for current user. This does work without SUDO of course as we are not changing 'system' files
• Make - call to the MakeFile in the 1_Utilities/deviceQuery folder (which exists and has what appears to be complete script content)
./bin/x86_64/darwin/release/deviceQuery - should call the executable that should have been made via the Make command.
However, the Make command is NOT making the executable, is NOT making the new directory path and as such is NOT able to be called.
Something is up with the Makefile in the 1_Utilties/deviceQuery folder because it still states 'Nothing to be done for All" and as such its not Making the executable.
Or I have done something really wrong.
**** Update to previous post ****
I noted a MakeFile in the root Samples folder and ran a 'sudo make -C ./' - probably shouldn't have as I now appear to have installed ALL samples 🙂
The upside is I now have a 'bin' in the root of the samples folder and every sample has been built (as far as I can tell). Except, that is, 'deviceQuery' - there is no file in the 'bin/x86_64/darwin/release' folder as there is for every other sample.
I have managed to call the deviceQueryDrv from this folder and it outputs details of the driver as one might expect.
So, why is the deviceQuery not working or rather reporting that there is nothing to be done for All?
Hope you can help and moreover, answer whether this at all critical - which I doubt.
Hey guys is this still the required way? We cannot use latest Xcode / 418.168 CUDA / 3184.108.40.206.40.105 GPU driver?
I have the above installed but for some reason get No Cuda-capable device error
if I follow the guide above I get Segfault..
Any advice will be greatly appreciated