Sample setup guide for CUDA programming for machine learning and gaming on macOS using a NVIDIA eGPU, plus references, tutorials and generalizations that will apply to most hardware.
The ports/cables/adapters listed are simply the ones from the reference rig.
You can probably mix and match hardware in n→∞
different ways using the very same (or similar) software steps.
Just make sure to have a reasonable idea about your likelihood of success before you go on a buying spree.
Tip: check eGPU.io's for a list of other reference implementations, Linux, macOS, Windows (pure or bootcamped) all included. A wealthy source of information.
Thunderbolt is a multi-purpose interface designed by Apple and Intel. It can transfer a data at high speed, thus is adequate for transferring data between your computer and (e)GPU.
Long story short about Thunderbolt versions:
- Thunderbolt 3 looks like a USB-C port and provides 40Gbit/s bandwidth and is what you have in the latest computers, as the MacBook Pro from 2016-2017—or the Dell XPS, Razer Blade Stealth, etc.
- Thunderbolt 2 looks like a Mini DisplayPort port and provides 20Gbit/s bandwidth and is what you have in the previous generation computers, as the MacBook Pro from 2015 and earlier.
- Thunderbolt 1 is very outdated (and slow) and not worth taking into consideration anymore for these purposes.
The sample setup depicted in this guide is not optimal, given its need of Thunderbolt 3 -> Thunderbolt 2 conversion. This conversion is necessary in this case since the laptop has a TB2 port and the eGPU enclosure has a TB3 port. For this reason you end up requiring additional cabling and possibly limiting throughput to TB2 bandwidth.
If you are curious about the performance drops for GPUs connected via Thunderbolt (versions) versus PCIexpress, see a sample comparison. As usual, your mileage may vary and take things with a grain of salt.
TL;DR on Thunderbolt:
- If you are not restricted to TB2 because of an older laptop, go for a TB3->TB3 solution.
- If you are restricted to TB2, you can choose to go TB2->TB2 (cheaper) or TB3->TB2 (upgradable in the long term).
- Apple notebook as Apple MacBook Pro 13 inch 2015 (2x Thunderbolt 2)
- eGPU as Akitio Node (1x Thunderbolt 3, 1x PCIe), other options can be checked at eGPU.io buyer's guide.
- NVIDIA Pascal GPUs as GTX 1080 Ti (1x Displayport, 1x PCIe + others), Maxwell work as well.
- External display as Dell UP3216Q (1x Displayport + others)
- Displayport cable (2x Displayport, comes with display)
- Thunderbolt 3 cable (2x Thunderbolt 3, comes with Node)
- Apple Thunderbolt 2 cable (2x Thunderbolt 2)
- Apple Thunderbolt 2<->3 adapter (1x Thunderbolt 2 + 1x Thunderbolt 3)
- macOS native installation:
- macOS SIP disabling
- NVIDIA Web drivers, just match your OS version with driver version.
- NVIDIA CUDA drivers
- cuDNN
- Please refer to the Step-by-step Tutorials.
- eGPU enabling & automation:
- If on macOS 13.1 (High Sierra or maybe later): NVIDIA eGPU v2
- If on macOS 12.6 (Sierra) or earlier: Automate eGPU script
- WINE: for running Windows games on macOS (or Linux).
- Wineskin Winery, or WineBottler or even possibly CrossOver: related to WINE above, to make things easier.
These tutorials are meant to be very (one could say overly) descriptive. Nonetheless, for reasons unknown, you may eventually find some slight variations. I will do my best to keep this updated with the intrincacies of the process as people report it.
Support is likely yet untested. The lack of NVIDIA web drivers compatible with Mojave is holding the community back. Please refer to the corresponding macOS 10.14 support issue.
Support has improved substantially in the last months since macOS 10.13.4, both because of Apple and the eGPU community.
Currently the best alternatives are:
Detailed step-by-step guides will be provided over time as I am able to test them.
- Install macOS High Sierra (works with v13.1–v13.3). Please note v13.4–v13.5+ support is still experimental.
- Check if macOS System Integrity Protection (SIP) is enabled and/or enable it:
- Boot the computer in recovery mode: press and hold
Command⌘ + R
when hearing the chime sound. - Open the
Terminal
application from the top menu. - Type
csrutil status
and pressEnter↩︎
to see if SIP is enabled. - If SIP is not enabled, type
csrutil enable
, pressEnter↩︎
and reboot. - Reboot.
- Boot the computer in recovery mode: press and hold
- Apply the High Sierra supplemental update (if necessary).
- Install the corresponding [NVIDIA Web Driver](NVIDIA Web drivers to your OS version (remember, SIP must be enabled here).
- Reboot and log in normally.
- Disable macOS System Integrity Protection (SIP):
- Boot the computer in recovery mode: press and hold
Command⌘ + R
when hearing the chime sound. - Open the
Terminal
application from the top menu. - Type
csrutil status
and pressEnter↩︎
to see if SIP is enabled. - If SIP is enabled, type
csrutil disable
, pressEnter↩︎
- Reboot.
- Boot the computer in recovery mode: press and hold
- Download and install the NVIDIA eGPU v2 package (if for whatever reason v2 does not work for you even after these exact steps, try v1).
- Shut down your computer.
- Now boot without the eGPU connected, log in normally, and again shut down your computer.
- Connect your eGPU enclosure and boot your computer:
- In most cases it should work with the eGPU connected before turning the computer on.
- In case you see an indefinite black screen, only plug the eGPU after the chime sound, just when the Apple logo shows up.
- Voilà!
- Install macOS Sierra or slightly earlier (at this point v12.6).
- Check if macOS System Integrity Protection (SIP) is enabled and/or enable it:
- Boot the computer in recovery mode: press and hold
Command⌘ + R
when hearing the chime sound. - Open the
Terminal
application from the top menu. - Type
csrutil status
and pressEnter↩︎
to see if SIP is enabled. - If SIP is not enabled, type
csrutil enable
, pressEnter↩︎
and reboot. - Reboot.
- Boot the computer in recovery mode: press and hold
- Apply any supplemental updates (if necessary).
- Install the corresponding [NVIDIA Web Driver](NVIDIA Web drivers to your OS version (remember, SIP must be enabled here).
- Reboot and log in normally.
- Disable macOS System Integrity Protection (SIP):
- Boot the computer in recovery mode: press and hold
Command⌘ + R
when hearing the chime sound. - Open the
Terminal
application from the top menu. - Type
csrutil status
and pressEnter↩︎
to see if SIP is enabled. - If SIP is enabled, type
csrutil disable
, pressEnter↩︎
- Reboot.
- Boot the computer in recovery mode: press and hold
- Download the automate-eGPU script
- Decompress it your computer if downloaded as a zip file.
- Run the script with administrator privileges:
- Open your
Terminal
application (or equivalent) from the application launcher, dock or Spotlight. - Type
chmod +x
plus a space. This will allow you to run the script. - Either drag & drop the
automate-eGPU.sh
file over the terminal or navigate to its directory. - You should now have something like
chmod +x /this/is/the/dir/automate-eGPU.sh
, then pressEnter↩︎
. 5. Now typesudo
plus a space. This will run the script with administrator privileges. - Again, either drag & drop the
automate-eGPU.sh
file over the terminal or navigate to its directory. - You should now have something like
sudo /this/is/the/dir/automate-eGPU.sh
, then pressEnter↩︎
. - Type in your user password and press
Enter↩︎
. - You will see outputs similar to these.
- Whenever asked to download NVIDIA drivers, type
y
andEnter↩︎
.
- Open your
- Reboot your computer normally.
- Turn on you eGPU enclosure and hot plug it into your computer via the respective Thunderbolt ports.
- Again, run the script with administrator privileges:
- Open your
Terminal
application (or equivalent) from the application launcher, dock or Spotlight. 2. Now typesudo
plus a space. This will run the script with administrator privileges. - Again, either drag & drop the
automate-eGPU.sh
file over the terminal or navigate to its directory. - You should now have something like
sudo /this/is/the/dir/automate-eGPU.sh
, then pressEnter↩︎
. - Type in your user password and press
Enter↩︎
. - You will see outputs similar to these.
- In some cases, you may have to add the
-a
flag after the script name (e.g.:sudo ./automate-eGPU.sh -a
). If not successful with-a
, run it again but now with the-m
flag (e.g.:sudo ./automate-eGPU.sh -m
), start over and skip this step.
- Open your
- Shut down your computer.
- Connect your eGPU enclosure and boot your computer:
- In most cases it should work with the eGPU connected before turning the computer on.
- In case you see an indefinite black screen, only plug the eGPU after the chime sound, just when the Apple logo shows up.
- Voilà!
For now I recommend NVIDIA's CUDA on macOS installation guide, as it is very extensive, detailed and up-to-date. So far you can trust it, but you might run into a few quirks covered below. As a rule-of-thumb, mind your macOS, Xcode, NVIDIA drivers, CUDA, cuDNN versions, as well as their compatibility with your machine learning library of choice and always research before updating any of them. This effectively means that at a certain point in time you will likely be running at least one of those (and eventually even all) in not-so-cutting-edge versions—to be fair, like in most of modern software development.
After installing CUDA, you may want to download and run CUDA-Z for testing and statistics.
Another way of detecting CUDA devices, if you have chosen to install NVIDIA CUDA Samples, is to use DeviceQuery
:
- Go to the directory where CUDA Samples where installed, e.g.:
~/NVIDIA_CUDA-9.0_Samples/1_Utilities/deviceQuery
. - On your terminal run:
make -f Makefile
to compile the application. - Once compiled, you can run it as
./deviceQuery
to identify CUDA-capable devices. Sample output.
nvcc fatal : The version ('xxxxx') of the host compiler ('Apple clang') is not supported
error:
This error is fairly common if you tend to keep your Xcode and/or C compiler up-to-date. It is caused by an incompatibility between the current Xcode (clang) compiler version installed and "active", and the versions supported by CUDA, which usually lags some months behind.
The fix is straightforward, but might be time consuming given Xcode humongous size:
- Check the latest version of Xcode supported by CUDA in NVIDIA's installation guide requirements for macOS.
- Download the above respective version in Apple's Developer downloads page.
- Unpack the zip file and rename the extracted
Xcode.app
adding the version it corresponds to, to avoid name collision and help you discern between the latest and last compatible versions, e.g.:Xcode_8.3.3.app
. - From you terminal, run:
sudo xcode-select -s /path/to/your/Xcode_8.3.3.app/Contents/Developer
(requires administrator privileges). - You can also use a simple bash script I provided to select the "active" Xcode version. Just make sure to edit it accordingly with your paths and filenames.
Segmentation fault: 11
error:
You might get these errors when running (even successfully) compiled CUDA programs or ML libraries. This is likely a NVIDIA mistake when naming and packaging the CUDA installation files and setting PATHs, as widely reported by TensorFlow users.
The fix in this case involves creating a symbolic link between the library called and the existing one, as well as setting correct PATHs:
- From your terminal, run:
sudo ln -sf /usr/local/cuda/lib/libcuda.dylib /usr/local/cuda/lib/libcuda.1.dylib
(requires administrator privileges). - Add the following PATHs to your environment profile (e.g.:
~/.bash_profile
,~/.zshrc
or equivalent):- Add
export CUDA_HOME=/usr/local/cuda
. - Add
export DYLD_LIBRARY_PATH="$CUDA_HOME/lib:$DYLD_LIBRARY_PATH"
. - Add
export PATH="$CUDA_HOME/bin:$PATH"
.
- Add
- Restart or reload your shell to apply the new PATHs.
MIT License
Copyright (c) 2017-2018 Marcelo Novaes
For more information, see LICENSE.