• Willkommen im Linux Club - dem deutschsprachigen Supportforum für GNU/Linux. Registriere dich kostenlos, um alle Inhalte zu sehen und Fragen zu stellen.

Ata frozen

Ich betreibe die Platte /dev/sdb als Spare Disk in einem Raid Array. Da ich sie schonen will, dachte ich es wäre eine gute Idee sie in Standby zu versetzen
Code:
/sbin/hdparm -y /dev/sdb
Doch ich erhalte nun die folgenden Meldungen im Log
Code:
	Nov 26 09:27:04 tox kernel: ata1.01: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
	Nov 26 09:27:04 tox kernel: ata1.01: cmd b0/da:00:00:4f:c2/00:00:00:00:00/10 tag 0
	Nov 26 09:27:04 tox kernel:          res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
	Nov 26 09:27:04 tox kernel: ata1.01: status: { DRDY }
	Nov 26 09:27:07 tox kernel: ata1: soft resetting link
	Nov 26 09:27:07 tox kernel: ata1.00: configured for UDMA/100
	Nov 26 09:27:07 tox smartd[12682]: Device: /dev/sdb, not capable of SMART self-check
	Nov 26 09:27:07 tox kernel: ata1.01: configured for UDMA/100
	Nov 26 09:27:07 tox kernel: ata1: EH complete
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] Write Protect is off
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] 1465149168 512-byte hardware sectors (750156 MB)
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] Write Protect is off
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] Mode Sense: 00 3a 00 00
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] Write Protect is off
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
	Nov 26 09:27:07 tox kernel: sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] 1465149168 512-byte hardware sectors (750156 MB)
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] Write Protect is off
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] Mode Sense: 00 3a 00 00
	Nov 26 09:27:07 tox kernel: sd 0:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Das Verhalten ist reproduzierbar. Googeln hat mich zur Vermutung gebracht, dass der Wiederanlauf der Platte nicht innerhalb des Timeouts des Treibers libata-scsi erfolgt. Es wird der Fehler erkannt, die IDE-Schnittstelle wird resettet und die beiden Platten daran wieder aktiviert.
Da /dev/sda die Systemplatte ist, stockt ds ganze System kurz. Es zeigen sich aber keine weiteren Auswirkungen.
Wie kann ich den Timeout verhindern? Gibt es Timeout-Parameter? Gibt es einen Patch?

Weitere Infos:
smartctl -a /dev/sdb:
Code:
=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.10 family
Device Model:     ST3750640A
Serial Number:    5QD56M3X
Firmware Version: 3.AAE
User Capacity:    750,156,374,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Wed Nov 26 15:03:14 2008 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                 ( 430) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 202) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   108   100   006    Pre-fail  Always       -       217163649
  3 Spin_Up_Time            0x0003   092   092   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       32
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   066   060   030    Pre-fail  Always       -       4290378
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       75
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       5
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   060   056   045    Old_age   Always       -       40 (Lifetime Min/Max 29/40)
194 Temperature_Celsius     0x0022   040   044   000    Old_age   Always       -       40 (0 25 0 0)
195 Hardware_ECC_Recovered  0x001a   071   063   000    Old_age   Always       -       241443292
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0000   100   253   000    Old_age   Offline      -       0
202 TA_Increase_Count       0x0032   100   253   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%        17         -
# 2  Short offline       Completed without error       00%        15         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
uname -a
Code:
Linux tox 2.6.25.18-0.2-pae #1 SMP 2008-10-21 16:30:26 +0200 i686 i686 i386 GNU/Linux
:lsmod |grep ata
Code:
pata_pdc202xx_old      24576  0
pata_via               27780  5
libata                176604  2 pata_pdc202xx_old,pata_via
scsi_mod              168436  5 st,sr_mod,sg,sd_mod,libata
dock                   27536  1 libata
lspci
Code:
00:00.0 Host bridge: VIA Technologies, Inc. VT8753 [P4X266 AGP] (rev 01)
00:01.0 PCI bridge: VIA Technologies, Inc. VT8633 [Apollo Pro266 AGP]
00:0a.0 Mass storage controller: Promise Technology, Inc. PDC20268 (Ultra100 TX2) (rev 02)
00:0b.0 Mass storage controller: Promise Technology, Inc. 20269 (rev 02)
....
00:10.0 RAID bus controller: Promise Technology, Inc. PDC20265 (FastTrak100 Lite/Ultra100) (rev 02)
00:11.0 ISA bridge: VIA Technologies, Inc. VT8233 PCI to ISA Bridge
00:11.1 IDE interface: VIA Technologies, Inc. VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus Master IDE (rev 06)
...
:hwinfo --ide
Code:
.....
23: IDE 01.0: 10600 Disk
  [Created at block.234]
  UDI: /org/freedesktop/Hal/devices/storage_serial_SATA_ST3750640A_5QD56M3X
  Unique ID: WZeP.zk_HfbDHSC1
  Parent ID: +Y+h.mcsZ0a690T4
  SysFS ID: /class/block/sdb
  SysFS BusID: 0:0:1:0
  SysFS Device Link: /devices/pci0000:00/0000:00:11.1/host0/target0:0:1/0:0:1:0
  Hardware Class: disk
  Model: "ST3750640A"
  Device: "ST3750640A"
  Revision: "3.AA"
  Serial ID: "5QD56M3X"
  Driver: "pata_via", "sd"
  Driver Modules: "pata_via"
  Device File: /dev/sdb
  Device Files: /dev/sdb, /dev/disk/by-id/scsi-SATA_ST3750640A_5QD56M3X, /dev/disk/by-id/ata-ST3750640A_5QD56M3X, /dev/disk/by-path/pci-0000:00:11.1-scsi-0:0:1:0
  Device Number: block 8:16-8:31
  Geometry (Logical): CHS 91201/255/63
  Size: 1465149168 sectors a 512 bytes
  Config Status: cfg=no, avail=yes, need=no, active=unknown
  Attached to: #7 (IDE interface)

24: IDE 00.0: 10600 Disk
  [Created at block.234]
  UDI: /org/freedesktop/Hal/devices/storage_serial_SATA_ST3120022A_5JT0R49X
  Unique ID: 3OOL.xykse_U1yiA
  Parent ID: +Y+h.mcsZ0a690T4
  SysFS ID: /class/block/sda
  SysFS BusID: 0:0:0:0
  SysFS Device Link: /devices/pci0000:00/0000:00:11.1/host0/target0:0:0/0:0:0:0
  Hardware Class: disk
  Model: "ST3120022A"
  Device: "ST3120022A"
  Revision: "3.06"
  Serial ID: "5JT0R49X"
  Driver: "pata_via", "sd"
  Driver Modules: "pata_via"
  Device File: /dev/sda
  Device Files: /dev/sda, /dev/disk/by-id/scsi-SATA_ST3120022A_5JT0R49X, /dev/disk/by-id/ata-ST3120022A_5JT0R49X, /dev/disk/by-path/pci-0000:00:11.1-scsi-0:0:0:0, /dev/disk/by-id/edd-int13_dev80
  Device Number: block 8:0-8:15
  Geometry (Logical): CHS 14593/255/63
  Size: 234441648 sectors a 512 bytes
  Config Status: cfg=no, avail=yes, need=no, active=unknown
  Attached to: #7 (IDE interface)
 

misiu

Moderator
Teammitglied
Diese Probleme kommen ab und zu von schlechtem SATA-Kabel oder , wenn die Platte nicht auf
SATA 150 runtergejumpert wurde. Libata-Parameter gibt es nicht, da es mittlerweile , wie du
bereits gemerkt hast, automatisch geschieht. Zuerst stop, dann runter auf die kleinere Übertragungsstufe,
wenn es immer noch nicht geht dann wieder stop und nochmal runter.

MfG
misiu
 

misiu

Moderator
Teammitglied
Es gibt einige Kernelboot-Parameter, ob diese helfen, konnte ich nicht testen:
( aus /usr/src/linux/Documentation/kernel-parameters.txt
Kernel-Quellen müssen installiert sein für diese Dokumentation)
libata.dma= [LIBATA] DMA control
libata.dma=0 Disable all PATA and SATA DMA
libata.dma=1 PATA and SATA Disk DMA only
libata.dma=2 ATAPI (CDROM) DMA only
libata.dma=4 Compact Flash DMA only
Combinations also work, so libata.dma=3 enables DMA
for disks and CDROMs, but not CFs.

libata.noacpi [LIBATA] Disables use of ACPI in libata suspend/resume
when set.
Format: <int>

libata.force= [LIBATA] Force configurations. The format is comma
separated list of "[ID:]VAL" where ID is
PORT[:DEVICE]. PORT and DEVICE are decimal numbers
matching port, link or device. Basically, it matches
the ATA ID string printed on console by libata. If
the whole ID part is omitted, the last PORT and DEVICE
values are used. If ID hasn't been specified yet, the
configuration applies to all ports, links and devices.

If only DEVICE is omitted, the parameter applies to
the port and all links and devices behind it. DEVICE
number of 0 either selects the first device or the
first fan-out link behind PMP device. It does not
select the host link. DEVICE number of 15 selects the
host link and device attached to it.

The VAL specifies the configuration to force. As long
as there's no ambiguity shortcut notation is allowed.
For example, both 1.5 and 1.5G would work for 1.5Gbps.
The following configurations can be forced.

* Cable type: 40c, 80c, short40c, unk, ign or sata.
Any ID with matching PORT is used.

* SATA link speed limit: 1.5Gbps or 3.0Gbps.

* Transfer mode: pio[0-7], mwdma[0-4] and udma[0-7].
udma[/][16,25,33,44,66,100,133] notation is also
allowed.

* [no]ncq: Turn on or off NCQ.

If there are multiple matching configurations changing
the same attribute, the last one is used.
 
OP
S

switcher51

Member
Vielen Dank für die Info zu den Kernel-Parametern.
Das Kommando
Code:
modinfo -p libata
hat folgende Information ausgegeben:
Code:
allow_tpm:Permit the use of TPM commands
noacpi:Disables the use of ACPI in probe/suspend/resume when set
ata_probe_timeout:Set ATA probing timeout (seconds)
dma:DMA enable/disable (0x1==ATA, 0x2==ATAPI, 0x4==CF)
ignore_hpa:Ignore HPA limit (0=keep BIOS limits, 1=ignore limits, using full disk)
fua:FUA support (0=off, 1=on)
atapi_passthru16:Enable ATA_16 passthru for ATAPI devices; on by default (0=off, 1=on)
atapi_dmadir:Enable ATAPI DMADIR bridge support (0=off, 1=on)
atapi_enabled:Enable discovery of ATAPI devices (0=off, 1=on)
force:Force ATA configurations including cable type, link speed and transfer mode (see Documentation/kernel-parameters.txt for details)
acpi_gtf_filter:filter mask for ACPI _GTF commands, set to filter out (0x1=set xfermode, 0x2=lock/freeze lock)
Für mein Problem scheint ata_probe_timeout:Set ATA probing timeout (seconds) ein Ansatz zu sein. Liege ich da richtig?
Leider finde ich hierzu keine Beschreibung in kernel-parameters.txt
Wie könnte ich da weiter vorgehen? (Ich bin sehr vorsichtig, da die Systemplatte und das Raid Array daran hängen.)
 
Oben