SQFlash Intelligent SSD Self-Management

3y ago
38 Views
4 Downloads
1.07 MB
11 Pages
Last View : 11d ago
Last Download : 3m ago
Upload by : Ciara Libby
Transcription

SQFlash Intelligent SSD Self-ManagementAuthor: Ares.ChengKeywords: SATA, SSD, S.M.A.R.T.To meet the increasing demand for higher data volumes, the unit storage density ofNAND flash has been continuously improved through substantial advancements inprocess technology that range from SLC to 3D NAND TLC. Yet, the downside ofthis improvement is reduced P/E cycles for high-density NAND flash and increasedrisk of data corruption. To facilitate monitoring of storage device status, the JEDECAssociation established relevant Self-Monitoring, Analysis, and ReportingTechnology (S.M.A.R.T.) specifications for SATA storage devices. However,because such instructions typically necessitate system technology-relatedknowledge, most users experience difficulty obtaining and utilizing such information,as well as performing adequate risk assessments and remote monitoring inaccordance with S.M.A.R.T. instructions.To reduce the risk of data corruption/loss and ensure data access, Advantechdeveloped its SQFlash Utility and WISE-PaaS/DeviceOn platform to help customersimplement S.M.A.R.T. specifications, predict product lifespans, and remotelymonitor system status in real-time using a simple graphical user interface. Thispaper provides a detailed description of the intelligent SSD self-managementsystem developed based on Advantech SQFlash.0www.advantech.com

Table of ContentIntelligent SSD Self-Management . 2SSD Information and Endurance Check . 2SSD Lifespan Prediction . 5General Prediction . 5Endurance Estimation . 6Remote Management . 7Predictive Maintenance Quality . 9Conclusion . 101www.advantech.com

Intelligent SSD Self-ManagementSSD Information and Endurance CheckS.M.A.R.T. information in the SATA specification contains up to 256information items. Some information is specific to traditional HDDs orSSD-related information, while some is considered manufacturer-defined orundefined information. To help users quickly understand the SSD statusand information, Advantechโ€™s SQFlash Utility provides data for the 10 keyparameters listed in the Information column, as shown in Figure 1. The datais obtained by decoding and calculating S.M.A.R.T. attributes. Theremaining SSD endurance is presented in percentage using a pie chart.With this information, users can quickly understand the SSD status andestablish contingency measures accordingly.Figure 1. SSD Information and Endurance CheckAdditionally, the provision of easy access to S.M.A.R.T data via theSQFlash Utility allows users to obtain more advanced information regardingthe SSD status. As shown in Figure 2, many self-defined items are includedin the SQFlash S.M.A.R.T. data, such as 10h, 11h, 64h, A8h, AAh, ADh,AFh, DAh, E7h, EAh, and EBh.2www.advantech.com

Figure 2. SQFlash S.M.A.R.T. DataDATA ADDRESS (Byte)IDATTRIBUTE NAME10901h Raw Read Error Rate8765Uncorrectable ECC Count09h Power On HoursPower On Hours0Ch Power Cycle CountPower On/Off Counts0Eh Device Capacity00Device Capacity0Fh User Capacity00User Capacity00Total Available Spare Block11h Spare Blocks Remaining00Remaining Spare Block64h Total Erase Count00Total Erase Count10hInitial Spare BlocksAvailableA8h SATA PHY Error CountSATA PHY Error CountAAh Bad Block CountLater Bad00Early BadADh Erase Count00Avg. EraseMax Erase00AEhAFhUnexpected Power LossCountPower Failure ProtectionStatusUnexpected Power Loss CountVoltage Stabilizer Trigger Count3www.advantech.comGuaranteedDrive StatusFlush(0x00 Normal)

DATA ADDRESS (Byte)IDATTRIBUTE NAME1098765(0x01 Enable)C0hUnexpected Power LossCountC2h TemperatureCAh00Max. Temp.Percentage of SparesRemaining00000CountCurrent Temp.SSD Life0UsedCRC Error CountE7h SSD Life RemainingEBh Total NAND WrittenUnexpected Power LossMin. Temp.DAh CRC ErrorEAh Total NAND Read000000SSD Life LeftTotal NAND Read (Sector, 512B)Total NAND Written (Sector, 512B)F1h Total Host WriteHost Write (Sector, 512B)F2h Total Host ReadHost Read (Sector, 512B)Table 1. S.M.A.R.T. ID DefinitionsIDDescription of the SQFlash-Defined S.M.A.R.T. IDs10hInitial Spare Blocks Available: The spare block count when the SSD is newly made.11hSpare Blocks Remaining: Currently available spare blocks.64hTotal Erase Count: Sum of the erase count from all blocks.A8hSATA PHY Error Count: The PHY error count - such as data FIS CRC, code error,disparity error, command FIS CRC . etc. This value resets to zero when powered off.Bad Block Count: Block is a capacity unit of NAND flash. Bad blocks are damagedblocks that the SSD controller has marked as unusable.There are two types of bad blocks โ€” early bad blocks and later bad blocks.AAhEarly bad blocks are the result of a manufacturing fault.Later bad blocks result from improper usage. If a block has too many ECCs, the SSDcontroller will mark the block as a later bad block. The erase count for the block will needto be checked. For SSDs nearing the end of their lifespan, later bad blocks are normal.Erase Count: This value is directly related to the SSD lifespan.There are two types of erase counts โ€” the average erase count and max. erase count.The average erase count is the average of the erase count for all blocks.ADhThe max. erase count is the highest erase count of all blocks.The SSD endurance is dependent on the NAND flash type, as shown in Table 2.4www.advantech.com

IDDescription of the SQFlash-Defined S.M.A.R.T. IDsNAND Flash TypeP/E Cycle (times)SLC100,000UltraMLC30,000MLC3,0003D TLC (BiCS 3)3,000Table 2. P/E Cycle Times According to NAND TypePower Failure Protection Status: There are three power failure protection statuses.Voltage Stabilizer Trigger Count: The number of times the voltage stabilizer circuit hasAFhbeen triggered.Guaranteed Flush: This indicates the function is on/off status (0x01 Enable).Drive Status: Error code of power failure protection functions (0x00 Normal).CRC Error: Data error coding between the controller and host. If the CRC error count isDAhnot 0, the SATA signal is poor. Users should conduct a SATA signal trace from the host tothe device (PCB layout, SATA cable, and SATA connector).SSD Life Remaining, SSD Life Left: Presented as a percentage and calculated based onE7hthe average erase count and NAND reference erase count.๐‘จ๐’—๐’†๐’“๐’‚๐’ˆ๐’† ๐‘ฌ๐’“๐’‚๐’”๐’† ๐‘ช๐’๐’–๐’๐’•๐’๐’๐ƒ ๐‹๐ข๐Ÿ๐ž ๐‹๐ž๐Ÿ๐ญ ๐Ÿ ()%๐‘ต๐‘จ๐‘ต๐‘ซ ๐’“๐’†๐’‡๐’†๐’“๐’†๐’๐’„๐’† ๐’†๐’“๐’‚๐’”๐’† ๐’„๐’๐’–๐’๐’•EAhEBhTotal NAND Read: The total data size that the SSD controller reads from NAND flash.Sector unit size is 512 bytes.Total NAND Written: The total data size that the SSD controller writes to NAND flash.Sector unit size is 512 bytes.Table 3. SQFlash-Defined S.M.A.R.T. IDsSSD Lifespan PredictionAlthough the Endurance Check pie chart can be referenced to determinethe remaining product lifespan, the pie chart cannot predict the exactend-of-life (EOL). To help users predict the SSD lifespan both generally andunder specific operating conditions, Advantechโ€™s SQFlash Utility softwarefeatures General Prediction and Endurance Estimation functions.General PredictionThe General Prediction function provides a general EOL prediction for theSSD. This prediction is based on the remaining SSD lifespan andcalculated using the following formula:5www.advantech.com

๐„๐ฌ๐ญ๐ข๐ฆ๐š๐ญ๐ž๐ ๐ฎ๐ฌ๐š๐›๐ฅ๐ž ๐ญ๐ข๐ฆ๐ž (๐‘ญ๐’๐’‚๐’”๐’‰ ๐‘ท/๐‘ฌ ๐‘ช๐’š๐’„๐’๐’† ๐‘จ๐’—๐’†๐’“๐’‚๐’ˆ๐’† ๐‘ฌ๐’“๐’‚๐’”๐’† ๐‘ช๐’š๐’„๐’๐’†) ๐‘บ๐’‚๐’‡๐’†๐’•๐’š ๏ฟฝ๏ฟฝ ๐‘ฌ๐’“๐’‚๐’”๐’† ๐‘ท๐’๐’˜๐’†๐’“ ๐’๐’ ๐’‰๐’๐’–๐’“๐’”The predicted EOL provided by the General Prediction function ispresented as a specific date and time, as shown in Figure 3. This date isdetermined based on the estimated usable time and the current date andtime. The Safety Factor acts as a buffer that allows users to prepare forSSD replacement. The Safety Factor value can be set as 0.8.Figure 3. SSD Estimated LifespanEndurance EstimationThe Endurance Estimation function provides an exact EOL prediction forthe SSD under specific operating scenarios. This function can assist userswith risk assessments and spare parts inventory management. Thepredicted EOL is determined by using the โ€œStartโ€ and โ€œStopโ€ buttons tomeasure the average erase cycle generated by writing data within aspecified period of time. The Write Amplification and Terabyte Written(TBW) factors for the specific operating scenario can be calculated usingthe following formula:๐’๐’๐ƒ ๐‹๐ข๐Ÿ๐ž๐ญ๐ข๐ฆ๐ž ๐„๐ฌ๐ญ๐ข๐ฆ๐š๐ญ๐ข๐จ๐ง ๐–๐ซ๐ข๐ญ๐ž ๏ฟฝ๏ฟฝ ๐“๐๐– ๐‘ญ๐’๐’‚๐’”๐’‰ ๐‘ด๐’‚๐’™๐’Š๐’Ž๐’–๐’Ž ๐‘ท/๐‘ฌ ๐‘ช๐’š๐’„๐’๐’† ๐‘ป๐’†๐’”๐’• ๐’•๐’Š๐’Ž๐’†๐‘จ๐’—๐’†๐’“๐’‚๐’ˆ๐’† ๐‘ฌ๐’“๐’‚๐’”๐’† ๏ฟฝ๏ฟฝ ๐‘ฌ๐’“๐’‚๐’”๐’† ๐‘ช๐’š๐’„๐’๐’†๐’” ๐‘บ๐‘บ๐‘ซ ๐‘ช๐’‚๐’‘๐’‚๐’„๐’Š๐’•๐’š๐‘ฏ๐’๐’”๐’• ๐‘พ๐’“๐’Š๐’•๐’†๐‘บ๐‘บ๐‘ซ ๐‘ช๐’‚๐’‘๐’‚๐’„๐’Š๐’•๐’š ๐‘ญ๐’๐’‚๐’”๐’‰ ๐‘ท/๐‘ฌ ๐‘ช๐’š๐’„๐’๐’†๐‘พ๐’“๐’Š๐’•๐’† ๏ฟฝ๏ฟฝ*Average Erase Cycles: The average erase cycles added since the Start button was last pressed.*Host Write: The total amount of data written by the host to the SSD since the Start button was last pressed.When the โ€œStartโ€ button is pressed, data for the specific operating scenariois collected in real time, as shown in Figure 4. When the task or operationis completed, press the โ€œStopโ€ button to obtain the results. A pop-upwindow displaying the text results will then appear, as shown in Figure 5.6www.advantech.com

Figure 4. Endurance Estimation Real-Time DataFigure 5. Test Results for a Specific Operating ScenarioRemote ManagementFor SSD status and lifespan monitoring, industrial application users mayhave one-to-many or many-to-one remote monitoring requirements. Inorder to meet diverse needs, Advantechโ€™s WISE-PaaS/DeviceOn remotemanagement and monitoring platform was equipped with a Remote SSDMonitoring function. The WISE-PaaS/DeviceOn platform comprises twomajor components โ€” a server-side system (the device manager) and adevice management agent. The server-side system communicates withdevices by using various protocols in order to provide both individual andbulk device control. Additionally, the server-side system facilitates remotemanagement of the software and applications deployed on devices, andcan be used to monitor/reset device applications or processes if necessary.7www.advantech.com

Figure 6. SQFlash Cloud-Based Management DiagramUsers can remotely monitor the SSDโ€™s current operating status andremaining lifespan in real time from a PC or cell phone via theWISE-PaaS/DeviceOn web page. The WISE-PaaS/DeviceOn web pageprovides critical information, such as the Endurance Check results, ECCcount and good Block Rate, as shown in Figure 7.Figure 7. WISE-PaaS/DeviceOn Operation Interface8www.advantech.com

To allow users to understand the SSD status at a glance, theWISE-PaaS/DeviceOn platform also features a built-in dashboard. Userscan customize the dashboard layout to emphasize or highlight specific data,as shown in Figure 8.Figure 8. WISE-PaaS/DeviceOn SQFlash DashboardPredictive Maintenance QualityTo provide users with another means for quickly understanding the SSDstatus, the WISE-PaaS/DeviceOn platform is also equipped with aPredictive Maintenance Quality (PMQ) function. The PMQ value iscalculated based on the predicted EOL provided by the General Predictionfunction (see Section 2.2.1), with historical RMA and DOA failure modeanalysis used as the auxiliary criterion. The RMA and DOA failure modeanalysis procedures are regularly updated to optimize machine learningalgorithms. The PMQ user interface shows the current health status andlatest event notifications. For events that require special attention, the PMQinterface will display a warning message or issue a user notification, asshown in Figure 9.9www.advantech.com

Figure 9. PMQ User InterfaceConclusionThe primary goal of Advantechโ€™s SQFlash development team is to assistSSD users with reducing data losses and securing assets. Advantechโ€™sSQFlash Utility provides an intuitive and convenient tool for obtaining theSSD health status. Using the SQFlash Utilityโ€™s SSD Life Prediction function,users can also optimize risk assessments and spare parts chโ€™sWISE-PaaS/DeviceOn platform can be used to conduct remote monitoringand management of SSDs in real time. Finally, Advantechโ€™s SQFlash Utilityalso supports customization to enable the achievement of intelligent SSDself-management according to specific usage requirements.Download SQFlash UtilityFor more information about industrial storage modules SQFlash, please om

monitor system status in real-time using a simple graphical user interface. This paper provides a detailed description of the intelligent SSD self-management system developed based on Advantech SQFlash. SQFlash Intelligent SSD Self-Management Author: Ares.Cheng Keywords: SATA, SSD, S.M.A.R.T.

Related Documents:

875319-b21 hpe 480gb sata ri m.2 2280 ds ssd 875587-b21 hpe 480gb nvme x4 ri sff scn ds ssd 875589-b21 hpe 960gb nvme x4 ri sff scn ds ssd 875591-b21 hpe 1.92tb nvme x4 ri sff scn ds ssd 875593-b21 hpe 400gb nvme x4 mu sff scn ds ssd 875595-b21 hpe 800gb nvme x4 mu sff scn ds ssd

SUSI-SQFlash Userโ€™s Manual Version 1.0.1802.1 Advantech Co. Ltd. No. 1, Alley 20, Lane 26, Rueiguang Road, Neihu District, Taipei 114, Taiwan, R. O. C.

4. effective 1 apr 13, structured self development 1 (ssd-1) is a prerequisite to attend the warrior leader course (wlc). effective 1 jun 13, structured self development 3 (ssd-3) is a prerequisite to attend the senior leader course (slc). effective 1 jun 13, structured self development 4 (ssd-4) is a prerequisite to

inch rack. This small-footprint all-flash model contains a 240-GB M.2 form-factor SSD that acts as the boot drive; a 240-GB housekeeping SSD; a 375-GB Optane NVMe SSD, 1.6-TB NVMe SSD, or 400-GB SAS SSD write-log drive; and six to eight 960-GB or 3.8-TB SATA SSDs for storage capacity.

64 bits aggregates Aggregate with snapshots, they must be deleted before converting into hybrid aggregate SSD rules: minimum number and extensions depending on the model e.g. FAS6000 9 2, 6 (with 100GB SSD) No mixed type of disks in a hybrid aggregate: just SAS SSD, FC SSD, SATA SSD. No mixed type of disks in a raid_gp.

SSD IV (STAFF SERGEANT (P) โ€“ MASTER SERGEANT / FIRST SERGEANT) SSD IV tasks are primarily focused at the battalion level. It is highly recommended that SSD IV be completed prior to assuming duties as a First Sergeant 4 Modules; 13 Lessons Effective 1 FEB 13, Staff Sergeants(P), Sergeants First Class, and Master Sergeants will be

THE DREAM DATA WAREHOUSE (CIRCA 2012) No management tasks, offered as a service Fast out-of-box with no tuning knobs Structured and semi-structured Petabyte scale . resume when needed SSD/RAM Cache SSD/RAM Cache SSD/RAM Cache SSD/RAM Cache Virtual warehouse A Virtual warehouse B Virtual warehouse C Virtua

not know; am I my brotherโ€™s keeper?โ€™ (Genesis 4:9) N NOVEMBER 2014 the Obama administration in the United States announced an extension of relief for immigrant families, prompting one cartoonist to caricature โ€˜an immigrant family climbing through a window to crash a white familyโ€™s Thanksgiving dinnerโ€™ with the โ€˜white father unhappily telling his family, โ€œThanks to the president .