Hi Pat, thanks for your reply!
Does the drive still get excessively warm when connected and the file system is mounted, or does this only happen after the drive has been ejected from the system and the drive letter is no longer assigned?
The drive begins heating from the moment it’s plugged in, and doesn’t stop heating until it’s unplugged. For example if I plug it in and leave it alone, it will heat up. If I plug it in and immediately “Power off this disk” using gnome-disks, it will also continue to heat up.
This likely indicates the operating system is accessing the drive for some reason.
This would make sense, but it’s occurring even after ejecting the drive on a fresh install of Ubuntu. I don’t have another device to test it on at the moment (only this Dell XPS 9360). I also tried disabling Thunderbolt in the BIOS and it has not made a difference. I have also tried using the USB-A cable instead with a different port, and it also does not make a difference.
I built smartmontools from src, and it prints output, then freezes for about a minute, then prints an error message. When this happens, it also “ejects” the drive (no longer available under /dev/sda), but the drive continues to heat up. The drive does not seem to be exceeding the 78 degree warning threshold though, which is a good sign.
# ./smartctl -a /dev/sda
smartctl 7.2 (build date Jul 28 2020) [x86_64-linux-5.4.0-42-generic] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Number: THNSN5256GPUK NVMe TOSHIBA 256GB
Serial Number: [snip]
Firmware Version: 5KDA4103
PCI Vendor/Subsystem ID: 0x1179
IEEE OUI Identifier: [snip]
Controller ID: 0
Number of Namespaces: 1
Namespace 1 Size/Capacity: 256,060,514,304 [256 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64: [snip]
Local Time is: Mon Jul 27 20:54:58 2020 [snip]
Firmware Updates (0x02): 1 Slot
Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test
Optional NVM Commands (0x001e): Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
Warning Comp. Temp. Threshold: 78 Celsius
Critical Comp. Temp. Threshold: 82 Celsius
Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 6.00W - - 0 0 0 0 0 0
1 + 2.40W - - 1 1 1 1 0 0
2 + 1.90W - - 2 2 2 2 0 0
3 - 0.0120W - - 3 3 3 3 5000 25000
4 - 0.0060W - - 4 4 4 4 100000 70000
Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 2
1 - 4096 0 1
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x00
Temperature: 67 Celsius
Available Spare: 100%
Available Spare Threshold: 50%
Percentage Used: 19%
Data Units Read: 24,274,927 [12.4 TB]
Data Units Written: 20,339,692 [10.4 TB]
Host Read Commands: 764,812,557
Host Write Commands: 593,173,812
Controller Busy Time: 2,439
Power Cycles: 4,036
Power On Hours: 9,375
Unsafe Shutdowns: 421
Media and Data Integrity Errors: 0
Error Information Log Entries: 0
Warning Comp. Temperature Time: 0
Critical Comp. Temperature Time: 0
Temperature Sensor 1: 67 Celsius
Read Error Information Log failed: Connection timed out
Edit: Here is the dmesg log if it gives you any useful information:
[ 3947.199911] usb 4-1: new SuperSpeedPlus Gen 2 USB device number 2 using xhci_hcd
[ 3947.233754] usb 4-1: New USB device found, idVendor=0bda, idProduct=9210, bcdDevice=31.00
[ 3947.233761] usb 4-1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 3947.233766] usb 4-1: Product: RTL9210
[ 3947.233769] usb 4-1: Manufacturer: Realtek
[ 3947.233772] usb 4-1: SerialNumber: 012345678903
[ 3947.240217] usb 4-1: Enable of device-initiated U1 failed.
[ 3947.241930] usb 4-1: Enable of device-initiated U2 failed.
[ 3947.303202] usb 4-1: Enable of device-initiated U1 failed.
[ 3947.304837] usb 4-1: Enable of device-initiated U2 failed.
[ 3947.308115] scsi host0: uas
[ 3947.312469] scsi 0:0:0:0: Direct-Access NVMe THNSN5256GPUK NV 4103 PQ: 0 ANSI: 6
[ 3947.323391] sd 0:0:0:0: Attached scsi generic sg0 type 0
[ 3947.335215] sd 0:0:0:0: [sda] 500118192 512-byte logical blocks: (256 GB/238 GiB)
[ 3947.337717] sd 0:0:0:0: [sda] Write Protect is off
[ 3947.337722] sd 0:0:0:0: [sda] Mode Sense: 37 00 00 08
[ 3947.342689] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 3947.346873] sd 0:0:0:0: [sda] Optimal transfer size 33553920 bytes
[ 3947.396234] sda: sda1 sda2 sda3
[ 3947.423610] sd 0:0:0:0: [sda] Attached SCSI disk
---- After starting smartctl ----
[ 4150.285057] sd 0:0:0:0: [sda] tag#12 uas_eh_abort_handler 0 uas-tag 1 inflight: CMD IN
[ 4150.285061] sd 0:0:0:0: [sda] tag#12 CDB: opcode=0xe4 (vendor) e4 00 20 02 01 00 00 00 00 00 00 00 00 00 00 00
[ 4150.305057] scsi host0: uas_eh_device_reset_handler start
[ 4155.648989] xhci_hcd 0000:39:00.0: Timeout while waiting for setup device command
[ 4161.028989] xhci_hcd 0000:39:00.0: Timeout while waiting for setup device command
[ 4161.240989] usb 4-1: device not accepting address 2, error -62
[ 4166.656893] xhci_hcd 0000:39:00.0: Timeout while waiting for setup device command
[ 4166.871548] usb 4-1: Device not responding to setup address.
[ 4167.076840] usb 4-1: device not accepting address 2, error -71
[ 4167.287543] usb 4-1: Device not responding to setup address.
[ 4167.499592] usb 4-1: Device not responding to setup address.
[ 4167.704836] usb 4-1: device not accepting address 2, error -71
[ 4167.919551] usb 4-1: Device not responding to setup address.
[ 4168.131644] usb 4-1: Device not responding to setup address.
[ 4168.336814] usb 4-1: device not accepting address 2, error -71
[ 4168.352864] scsi host0: uas_eh_device_reset_handler FAILED err -19
[ 4168.352874] sd 0:0:0:0: Device offlined - not ready after error recovery
[ 4168.352940] usb 4-1: USB disconnect, device number 2
[ 4168.359604] sd 0:0:0:0: [sda] Synchronizing SCSI cache
[ 4168.596857] sd 0:0:0:0: [sda] Synchronize Cache(10) failed: Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
[ 4168.855549] usb 4-1: Device not responding to setup address.
[ 4169.067527] usb 4-1: Device not responding to setup address.
[ 4169.272797] usb 4-1: device not accepting address 3, error -71
[ 4169.483549] usb 4-1: Device not responding to setup address.
[ 4169.695582] usb 4-1: Device not responding to setup address.
[ 4169.900984] usb 4-1: device not accepting address 4, error -71
[ 4169.909092] usb usb4-port1: attempt power cycle
[ 4170.843586] usb 4-1: Device not responding to setup address.
[ 4171.055785] usb 4-1: Device not responding to setup address.
---- After unplugging device ----
[ 4330.320805] pcieport 0000:00:1c.0: AER: Corrected error received: 0000:00:1c.0
[ 4330.320821] pcieport 0000:00:1c.0: AER: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 4330.320833] pcieport 0000:00:1c.0: AER: device [8086:9d10] error status/mask=00000001/00002000
[ 4330.320840] pcieport 0000:00:1c.0: AER: [ 0] RxErr
[ 4330.370162] xhci_hcd 0000:39:00.0: Refused to change power state, currently in D3
[ 4330.370190] xhci_hcd 0000:39:00.0: BAR 0: error updating (0xd9f00000 != 0xffffffff)
[ 4330.451804] xhci_hcd 0000:39:00.0: Refused to change power state, currently in D3
[ 4330.451825] xhci_hcd 0000:39:00.0: Controller not ready at resume -19
[ 4330.451827] xhci_hcd 0000:39:00.0: PCI post-resume error -19!
[ 4330.451830] xhci_hcd 0000:39:00.0: HC died; cleaning up
[ 4330.464131] xhci_hcd 0000:39:00.0: remove, state 4
[ 4330.464135] usb usb4: USB disconnect, device number 1
[ 4330.464773] xhci_hcd 0000:39:00.0: USB bus 4 deregistered
[ 4330.464780] xhci_hcd 0000:39:00.0: remove, state 4
[ 4330.464782] usb usb3: USB disconnect, device number 1
[ 4330.464970] xhci_hcd 0000:39:00.0: Host halt failed, -19
[ 4330.464973] xhci_hcd 0000:39:00.0: Host not accessible, reset failed.
[ 4330.465189] xhci_hcd 0000:39:00.0: USB bus 3 deregistered
[ 4330.467139] pcieport 0000:02:00.0: Refused to change power state, currently in D3
[ 4330.469839] pci_bus 0000:03: busn_res: [bus 03] is released
[ 4330.469956] pci_bus 0000:04: busn_res: [bus 04-38] is released
[ 4330.470184] pci_bus 0000:39: busn_res: [bus 39] is released
[ 4330.470300] pci_bus 0000:02: busn_res: [bus 02-39] is released