allenai/objaverse-rendering

How to run xserver on Runpod docker image

Opened this issue · 1 comments

information

  • tested on Runpod GPU docker image instance
    • x8 RTX 4090, 128 vCPU 502 GB RAM
    • runpod/pytorch:2.2.0-py3.10-cuda12.1.1-devel-ubuntu22.04
    • 400 GB Disk, 400 GB Pod Volume
  • installation was successful

reproduction of the error

starting xserver for headless machine causes following error :

(objaverse) root@1577f2c65f68:~/objaverse-rendering# python3 scripts/start_xserver.py start
Error: error with command 'Xorg -quiet -maxclients 1024 -noreset +extension GLX +extension RANDR +extension RENDER -logfile /var/log/ai2thor-xorg.0.log -config /tmp/ai2thor-xorg.conf :0'
xf86EnableIO: failed to enable I/O ports 0000-03ff (Operation not permitted)
vesa: Ignoring device with a bound kernel driver
(EE) 
Fatal server error:
(EE) no screens found(EE) 
(EE) 
Please consult the The X.Org Foundation support 
         at http://wiki.x.org
 for help. 
(EE) Please also check the log file at "/var/log/ai2thor-xorg.0.log" for additional information.
(EE) 
(EE) Server terminated with error (1). Closing log file.

below is the ai2thor-xorg.0.log file's detail :

(objaverse) root@1577f2c65f68:/var/log# cat ai2thor-xorg.0.log 
[3700561.532] 
X.Org X Server 1.21.1.4
X Protocol Version 11, Revision 0
[3700561.532] Current Operating System: Linux 1577f2c65f68 5.15.0-91-generic #101-Ubuntu SMP Tue Nov 14 13:30:08 UTC 2023 x86_64
[3700561.532] Kernel command line: BOOT_IMAGE=/vmlinuz-5.15.0-91-generic root=/dev/mapper/ubuntu--vg-ubuntu--lv ro systemd.unified_cgroup_hierarchy=false
[3700561.532] xorg-server 2:21.1.4-2ubuntu1.7~22.04.10 (For technical support please see http://www.ubuntu.com/support) 
[3700561.532] Current version of pixman: 0.40.0
[3700561.532]   Before reporting problems, check http://wiki.x.org
        to make sure that you have the latest version.
[3700561.532] Markers: (--) probed, (**) from config file, (==) default setting,
        (++) from command line, (!!) notice, (II) informational,
        (WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[3700561.532] (++) Log file: "/var/log/ai2thor-xorg.0.log", Time: Mon Apr 15 06:45:08 2024
[3700561.532] (++) Using config file: "/tmp/ai2thor-xorg.conf"
[3700561.532] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[3700561.532] (==) ServerLayout "Layout0"
[3700561.532] (**) |-->Screen "Screen0" (0)
[3700561.532] (**) |   |-->Monitor "<default monitor>"
[3700561.532] (**) |   |-->Device "Device0"
[3700561.532] (==) No monitor specified for screen "Screen0".
        Using a default monitor configuration.
[3700561.532] (**) |-->Screen "Screen1" (1)
[3700561.532] (**) |   |-->Monitor "<default monitor>"
[3700561.532] (**) |   |-->Device "Device1"
[3700561.532] (==) No monitor specified for screen "Screen1".
        Using a default monitor configuration.
[3700561.532] (**) |-->Screen "Screen2" (2)
[3700561.532] (**) |   |-->Monitor "<default monitor>"
[3700561.532] (**) |   |-->Device "Device2"
[3700561.532] (==) No monitor specified for screen "Screen2".
        Using a default monitor configuration.
[3700561.532] (**) |-->Screen "Screen3" (3)
[3700561.532] (**) |   |-->Monitor "<default monitor>"
[3700561.533] (**) |   |-->Device "Device3"
[3700561.533] (==) No monitor specified for screen "Screen3".
        Using a default monitor configuration.
[3700561.533] (**) |-->Screen "Screen4" (4)
[3700561.533] (**) |   |-->Monitor "<default monitor>"
[3700561.533] (**) |   |-->Device "Device4"
[3700561.533] (==) No monitor specified for screen "Screen4".
        Using a default monitor configuration.
[3700561.533] (**) |-->Screen "Screen5" (5)
[3700561.533] (**) |   |-->Monitor "<default monitor>"
[3700561.533] (**) |   |-->Device "Device5"
[3700561.533] (==) No monitor specified for screen "Screen5".
        Using a default monitor configuration.
[3700561.533] (**) |-->Screen "Screen6" (6)
[3700561.533] (**) |   |-->Monitor "<default monitor>"
[3700561.533] (**) |   |-->Device "Device6"
[3700561.533] (==) No monitor specified for screen "Screen6".
        Using a default monitor configuration.
[3700561.533] (**) |-->Screen "Screen7" (7)
[3700561.533] (**) |   |-->Monitor "<default monitor>"
[3700561.533] (**) |   |-->Device "Device7"
[3700561.533] (==) No monitor specified for screen "Screen7".
        Using a default monitor configuration.
[3700561.533] (==) Automatically adding devices
[3700561.533] (==) Automatically enabling devices
[3700561.533] (==) Automatically adding GPU devices
[3700561.533] (==) Automatically binding GPU devices
[3700561.533] (++) Max clients allowed: 1024, resource mask: 0x7ffff
[3700561.533] (WW) The directory "/usr/share/fonts/X11/cyrillic" does not exist.
[3700561.533]   Entry deleted from font path.
[3700561.533] (WW) The directory "/usr/share/fonts/X11/100dpi/" does not exist.
[3700561.533]   Entry deleted from font path.
[3700561.533] (WW) The directory "/usr/share/fonts/X11/75dpi/" does not exist.
[3700561.533]   Entry deleted from font path.
[3700561.533] (WW) The directory "/usr/share/fonts/X11/Type1" does not exist.
[3700561.533]   Entry deleted from font path.
[3700561.533] (WW) The directory "/usr/share/fonts/X11/100dpi" does not exist.
[3700561.533]   Entry deleted from font path.
[3700561.533] (WW) The directory "/usr/share/fonts/X11/75dpi" does not exist.
[3700561.533]   Entry deleted from font path.
[3700561.533] (==) FontPath set to:
        /usr/share/fonts/X11/misc,
        built-ins
[3700561.533] (==) ModulePath set to "/usr/lib/xorg/modules"
[3700561.533] (II) The server relies on udev to provide the list of input devices.
        If no devices become available, reconfigure udev or disable AutoAddDevices.
[3700561.533] (II) Loader magic: 0x562892579020
[3700561.533] (II) Module ABI versions:
[3700561.533]   X.Org ANSI C Emulation: 0.4
[3700561.533]   X.Org Video Driver: 25.2
[3700561.533]   X.Org XInput driver : 24.4
[3700561.533]   X.Org Server Extension : 10.0
[3700561.533] (EE) dbus-core: error connecting to system bus: org.freedesktop.DBus.Error.FileNotFound (Failed to connect to socket /run/dbus/system_bus_socket: No such file or directory)
[3700561.536] (II) xfree86: Adding drm device (/dev/dri/card4)
[3700561.536] (II) Platform probe for /sys/devices/pci0000:00/0000:00:01.1/0000:01:00.0/drm/card4
[3700561.537] (II) xfree86: Adding drm device (/dev/dri/card3)
[3700561.537] (II) Platform probe for /sys/devices/pci0000:20/0000:20:03.1/0000:25:00.0/drm/card3
[3700561.537] (II) xfree86: Adding drm device (/dev/dri/card2)
[3700561.537] (II) Platform probe for /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/drm/card2
[3700561.537] (II) xfree86: Adding drm device (/dev/dri/card1)
[3700561.537] (II) Platform probe for /sys/devices/pci0000:60/0000:60:03.1/0000:61:00.0/drm/card1
[3700561.538] (II) xfree86: Adding drm device (/dev/dri/card0)
[3700561.538] (II) Platform probe for /sys/devices/pci0000:60/0000:60:05.2/0000:62:00.0/0000:63:00.0/drm/card0
[3700561.538] (II) xfree86: Adding drm device (/dev/dri/card8)
[3700561.538] (II) Platform probe for /sys/devices/pci0000:80/0000:80:01.1/0000:81:00.0/drm/card8
[3700561.538] (II) xfree86: Adding drm device (/dev/dri/card7)
[3700561.538] (II) Platform probe for /sys/devices/pci0000:a0/0000:a0:03.1/0000:a1:00.0/drm/card7
[3700561.538] (II) xfree86: Adding drm device (/dev/dri/card6)
[3700561.538] (II) Platform probe for /sys/devices/pci0000:c0/0000:c0:01.1/0000:c1:00.0/drm/card6
[3700561.539] (II) xfree86: Adding drm device (/dev/dri/card5)
[3700561.539] (II) Platform probe for /sys/devices/pci0000:e0/0000:e0:03.1/0000:e1:00.0/drm/card5
[3700561.556] (--) PCI: (1@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xfa000000/16777216, 0x38090000000/268435456, 0x380a0000000/33554432, I/O @ 0x00003000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI: (37@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xc6000000/16777216, 0x28060000000/268435456, 0x28070000000/33554432, I/O @ 0x00004000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI: (65@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xcc000000/16777216, 0x20030000000/268435456, 0x20040000000/33554432, I/O @ 0x00005000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI: (97@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xf4000000/16777216, 0x18000000000/268435456, 0x18010000000/33554432, I/O @ 0x00008000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI:*(99@0:0:0) 1a03:2000:1849:2000 rev 65, Mem @ 0xf2000000/16777216, 0xf3000000/131072, I/O @ 0x00007000/128, BIOS @ 0x????????/131072
[3700561.556] (--) PCI: (129@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xb2000000/16777216, 0x58150000000/268435456, 0x58160000000/33554432, I/O @ 0x0000b000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI: (161@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xb8000000/16777216, 0x50120000000/268435456, 0x50130000000/33554432, I/O @ 0x0000c000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI: (193@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xbc000000/16777216, 0x480f0000000/268435456, 0x48100000000/33554432, I/O @ 0x0000d000/128, BIOS @ 0x????????/524288
[3700561.556] (--) PCI: (225@0:0:0) 10de:2684:10b0:f297 rev 161, Mem @ 0xc2000000/16777216, 0x400c0000000/268435456, 0x400d0000000/33554432, I/O @ 0x0000f000/128, BIOS @ 0x????????/524288
[3700561.556] (II) LoadModule: "glx"
[3700561.557] (II) Loading /usr/lib/xorg/modules/extensions/libglx.so
[3700561.557] (II) Module glx: vendor="X.Org Foundation"
[3700561.557]   compiled for 1.21.1.4, module version = 1.0.0
[3700561.557]   ABI class: X.Org Server Extension, version 10.0
[3700561.557] (II) LoadModule: "nvidia"
[3700561.558] (WW) Warning, couldn't open module nvidia
[3700561.558] (EE) Failed to load module "nvidia" (module does not exist, 0)
[3700561.558] (==) Matched ast as autoconfigured driver 0
[3700561.558] (==) Matched modesetting as autoconfigured driver 1
[3700561.558] (==) Matched fbdev as autoconfigured driver 2
[3700561.558] (==) Matched vesa as autoconfigured driver 3
[3700561.558] (==) Assigned the driver to the xf86ConfigLayout
[3700561.558] (II) LoadModule: "ast"
[3700561.558] (WW) Warning, couldn't open module ast
[3700561.558] (EE) Failed to load module "ast" (module does not exist, 0)
[3700561.558] (II) LoadModule: "modesetting"
[3700561.558] (II) Loading /usr/lib/xorg/modules/drivers/modesetting_drv.so
[3700561.558] (II) Module modesetting: vendor="X.Org Foundation"
[3700561.558]   compiled for 1.21.1.4, module version = 1.21.1
[3700561.558]   Module class: X.Org Video Driver
[3700561.558]   ABI class: X.Org Video Driver, version 25.2
[3700561.558] (II) LoadModule: "fbdev"
[3700561.558] (II) Loading /usr/lib/xorg/modules/drivers/fbdev_drv.so
[3700561.558] (II) Module fbdev: vendor="X.Org Foundation"
[3700561.558]   compiled for 1.21.1.3, module version = 0.5.0
[3700561.558]   Module class: X.Org Video Driver
[3700561.558]   ABI class: X.Org Video Driver, version 25.2
[3700561.558] (II) LoadModule: "vesa"
[3700561.558] (II) Loading /usr/lib/xorg/modules/drivers/vesa_drv.so
[3700561.558] (II) Module vesa: vendor="X.Org Foundation"
[3700561.558]   compiled for 1.21.1.3, module version = 2.5.0
[3700561.558]   Module class: X.Org Video Driver
[3700561.558]   ABI class: X.Org Video Driver, version 25.2
[3700561.558] (II) modesetting: Driver for Modesetting Kernel Drivers: kms
[3700561.558] (II) FBDEV: driver for framebuffer: fbdev
[3700561.558] (II) VESA: driver for VESA chipsets: vesa
[3700561.558] xf86EnableIO: failed to enable I/O ports 0000-03ff (Operation not permitted)
[3700561.558] (EE) open /dev/dri/card0: No such file or directory
[3700561.558] (WW) Falling back to old probe method for modesetting
[3700561.558] (EE) open /dev/dri/card0: No such file or directory
[3700561.558] (II) Loading sub module "fbdevhw"
[3700561.558] (II) LoadModule: "fbdevhw"
[3700561.558] (II) Loading /usr/lib/xorg/modules/libfbdevhw.so
[3700561.558] (II) Module fbdevhw: vendor="X.Org Foundation"
[3700561.558]   compiled for 1.21.1.4, module version = 0.0.2
[3700561.558]   ABI class: X.Org Video Driver, version 25.2
[3700561.558] (EE) Unable to find a valid framebuffer device
[3700561.558] (WW) Falling back to old probe method for fbdev
[3700561.558] (II) Loading sub module "fbdevhw"
[3700561.558] (II) LoadModule: "fbdevhw"
[3700561.558] (II) Loading /usr/lib/xorg/modules/libfbdevhw.so
[3700561.558] (II) Module fbdevhw: vendor="X.Org Foundation"
[3700561.558]   compiled for 1.21.1.4, module version = 0.0.2
[3700561.558]   ABI class: X.Org Video Driver, version 25.2
[3700561.558] (EE) open /dev/fb0: No such file or directory
[3700561.558] vesa: Ignoring device with a bound kernel driver
[3700561.558] (WW) VGA arbiter: cannot open kernel arbiter, no multi-card support
[3700561.558] (EE) Screen 0 deleted because of no matching config section.
[3700561.558] (II) UnloadModule: "modesetting"
[3700561.558] (EE) Screen 0 deleted because of no matching config section.
[3700561.558] (II) UnloadModule: "fbdev"
[3700561.558] (II) UnloadSubModule: "fbdevhw"
[3700561.558] (EE) Screen 0 deleted because of no matching config section.
[3700561.558] (II) UnloadModule: "vesa"
[3700561.558] (EE) Device(s) detected, but none match those in the config file.
[3700561.558] (EE) 
Fatal server error:
[3700561.558] (EE) no screens found(EE) 
[3700561.558] (EE) 
Please consult the The X.Org Foundation support 
         at http://wiki.x.org
 for help. 
[3700561.558] (EE) Please also check the log file at "/var/log/ai2thor-xorg.0.log" for additional information.
[3700561.558] (EE) 
[3700561.558] (EE) Server terminated with error (1). Closing log file.

I had the same problem where Xserver did not find a display on a Google Cloud Platform VM. To solve the problem, I booted a new Ubuntu image and installed the Nvidia drivers manually from here:
https://cloud.google.com/compute/docs/gpus/install-drivers-gpu

I suspect that the docker image you used doesn't have Nvidia drivers for display