Nvidia Driver not loading MacPro Ubuntu 18.04
 
Notifications
Clear all

Nvidia Driver not loading MacPro Ubuntu 18.04  

  RSS

(@antoinedau)
New Member
Joined: 3 months ago
 

Hi,

I'm trying to install CUDA through the .deb on Mac Pro Late 2013 running Ubuntu 18.04 to do some ML. The mac is with a dual boot OSX/Ubuntu with reFind.

I have a RTX 2080 Ti in a Razer Core X which is recognized by my system but it seems that the driver is not loading...

After following the CUDA installation guide for Linux, I get :

nvidia-smi

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

and

~$ nvidia-settings

ERROR: NVIDIA driver is not loaded


ERROR: Unable to load info from any available system

Eventhough,

~$ lspci | grep -i nvidia
19:00.0 VGA compatible controller: NVIDIA Corporation TU102 [GeForce RTX 2080 Ti Rev. A] (rev a1)

Looking at the bug report from nvidia I also found the following:

dmesg
[ 378.103711] NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:
NVRM: BAR1 is 0M @ 0x0 (PCI:0000:19:00.0)
[ 378.103714] NVRM: The system BIOS may have misconfigured your GPU.
[ 378.103726] nvidia: probe of 0000:19:00.0 failed with error -1
[ 378.103785] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 378.103787] NVRM: None of the NVIDIA devices were initialized.
[ 378.104219] nvidia-nvlink: Unregistered the Nvlink Core, major device number 237
[ 378.806726] nvidia-nvlink: Nvlink Core is being initialized, major device number 237
[ 378.807642] NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:
NVRM: BAR1 is 0M @ 0x0 (PCI:0000:19:00.0)
[ 378.807645] NVRM: The system BIOS may have misconfigured your GPU.
[ 378.807653] nvidia: probe of 0000:19:00.0 failed with error -1
[ 378.807699] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 378.807700] NVRM: None of the NVIDIA devices were initialized.
[ 378.807984] nvidia-nvlink: Unregistered the Nvlink Core, major device number 237
[ 379.506203] nvidia-nvlink: Nvlink Core is being initialized, major device number 237
[ 379.507025] NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:
NVRM: BAR1 is 0M @ 0x0 (PCI:0000:19:00.0)
[ 379.507026] NVRM: The system BIOS may have misconfigured your GPU.
[ 379.507033] nvidia: probe of 0000:19:00.0 failed with error -1
[ 379.507076] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 379.507077] NVRM: None of the NVIDIA devices were initialized.
...


and 

/usr/bin/lspci -d "10de:*" -v -xxx

19:00.0 VGA compatible controller: NVIDIA Corporation TU102 [GeForce RTX 2080 Ti Rev. A] (rev a1) (prog-if 00 [VGA controller])
Subsystem: eVga.com. Corp. Device 2484
Flags: fast devsel, IRQ 27
Memory at a1000000 (32-bit, non-prefetchable) [size=16M]
Memory at <ignored> (64-bit, prefetchable)
Memory at c0000000 (64-bit, prefetchable) [size=32M]
I/O ports at 5000 [size=128]
Expansion ROM at a2000000 [disabled] [size=512K]
Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Legacy Endpoint, MSI 00
Capabilities: [100] Virtual Channel
Capabilities: [250] Latency Tolerance Reporting
Capabilities: [258] L1 PM Substates
Capabilities: [128] Power Budgeting <?>
Capabilities: [420] Advanced Error Reporting
Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
Capabilities: [900] #19
Capabilities: [bb0] #15
Kernel modules: nvidiafb, nouveau, nvidia_drm, nvidia

Obviously, something's going wrong with memory... but I don't know what or how to fix it.

The mac pro has 16Gb RAM. Probably it's worth mentioning.

Any help would be very much appreciated!
Thank you.

MacPro 6.1 - RTX 2080Ti - Razer Core X - Ubuntu 18.04 (next to a Mac OS 10.15 partition)


ReplyQuote
nu_ninja
(@nu_ninja)
Reputable Member
Joined: 2 years ago
 

I'd recommend installing the nvidia driver package through the Additional Drivers tab in the Software & Updates app in Ubuntu rather than through a .deb to make sure everything stays up to date and complete.

The main issue though, seems to be a PCIe resource allocation issue. This is similar in nature to error 12 in windows. Can you try booting up with the eGPU attached and on and the following kernel parameters:

pcie_ports=native pci=assign-busses,nocrs,realloc iommu=on intel_iommu=on

Mid-2012 13" Macbook Pro (MacBookPro9,2) TB1 -> RX 460/560 (AKiTiO Node/Thunder2)
+ macOS 10.15+Win10 + Linux Mint 19.1

 
2012 13" MacBook Pro [3rd,2C,M] + RX 460 @ 10Gbps-TB1 (AKiTiO Thunder2) + macOS 10.14.4 [build link]  


ReplyQuote
(@antoinedau)
New Member
Joined: 3 months ago
 

@nu_ninja

I did what you suggested (purged previous installs, made a new one from Software & Updates and added the kernel parameters)

but the result is still the same.

 

MacPro 6.1 - RTX 2080Ti - Razer Core X - Ubuntu 18.04 (next to a Mac OS 10.15 partition)


ReplyQuote
(@vandmmages)
New Member
Joined: 9 months ago
 

@antoinedau, Hi ever get a fix?

 

i have a macboo pro 10.1 wiht a Geforce 1070 -- Razer-x

 

same error as you

NVRM: The system BIOS may have misconfigured your GPU.

I am on Ubuntu 20.4.

 

Macbook Pro Retina 10.1 (Mid 2012) . MACOS Catalina
Razer -X eGPU enclosure
AMD RTX 570 or if testing needed Nvida GTX1070
LG 34UM69G-B connected to DP of Razer-X


ReplyQuote
(@antoinedau)
New Member
Joined: 3 months ago
 

I got it working on a mac book air afterwise.

If I were you I'd try downgrading to 18.04 since it is LTS. But that's just a guess...

I "solved" my problem by finding out that it was unsolvable, since the firmware of the mac pro (the trash can, not your macBOOk pro) explictly excludes the turing architecture of the RTX series...

So I don't think I was having your problem.

MacPro 6.1 - RTX 2080Ti - Razer Core X - Ubuntu 18.04 (next to a Mac OS 10.15 partition)


ReplyQuote