• No results found

Empirical Inspection of IO subsystem for Flash Storage Device at the aspect of discard

N/A
N/A
Protected

Academic year: 2021

Share "Empirical Inspection of IO subsystem for Flash Storage Device at the aspect of discard"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

Empirical Inspection of IO subsystem for Flash Storage

Device at the aspect of discard

Seung-Ho Lim and Ki-Jin Kim

Division of Computer and Electronic System Engineering Hankuk University of Foreign Studies

slim@hufs.ac.kr, do131104@naver.com

Abstract. Flash-based storage devices, such as embedded Multi-Media Controller (eMMC) cards, have reached the mainstream market as storage solutions. Although the specification of eMMC has evolved enough to give much high bandwidth to host system, the host system cannot utilize well the distinct features of recent developed eMMC. In this paper, we have analyzed eMMC-based IO subsystem empirically at the aspect of discard command feature, derived several design issues of IO subsystems, By doing this, the required features can be extracted from the analysis, and derive a feawible approach to enhance IO throughput of the Flash-based IO subsystem.

Keywords: Flash Memory, Discard IO subsystem, eMMC, Empirical Analysis

1 Introduction

Flash-based storage devices, such as Solid State Drives (SSDs) and embedded Multi-Media Controller (eMMC) cards, have reached the mainstream market as storage solutions. However, the architecture and operational mechanism of traditional IO subsystem is largely dependent of rotational disk drives, which might not be adequate for the new storage media. Since the inherent physical characteristics of NAND flash memory and internal device architecture of the new storage media, i.e., SSDs or eMMCs are totally different from the inherent characteristics of mechanical characteristics of traditional rotational disk drives, the IO subsystem and device driver should be re-considered for higher device utilization, as well as overall system performance. In addition, recent releases of the SSD or eMMC-related standards [1][2] added distinct features to make use of inherent physical characteristics and enhance IO bandwidth of the products. For eMMC products, these include discard command, context ID, data tag, and packed commands.

However, before using of these new features and applying these to IO subsystem, the empirical analysis should be preceded. In this paper, we analyzed request patterns of IO subsystem with the new storage media, i.e., eMMC storage device. Especially, among the new features, the discard commands [3] and command queuing features are profiled and analyzed at the functional level. Then, based on the profiling and

(2)

analysis, we can derive a design methodology of IO subsystem and configuration parameters with these new features. The methodologies include discard-use methodology.

Table 1. 4 different file system's IO configurations

Conf. 1 Conf. 2 Conf. 3 Conf. 4 Base files 3000 2000 500 200 # transactions 10000(many) 5000(mid) 3000(mid) 1000(small)

File’s size 1KB ~ 10KB(small) 50KB ~100KB(mid) 500KB ~ 1MB(mid) 1MB ~ 2MB(large) User Block sizes Read = 512 bytes, write = 512 bytes

Biases Read / append = 5, create/delete = 5

2 Background and Empirical Analysis

In NAND Flash memory, the ‘Page’ is the unit of read or program. Likewise, bundle of several ‘Page’s called ‘Block’ is the unit of erase. The read and program command are related with data transfer between host and Flash device, while, the erase command has no data transfer between host and Flash device. The erase operation for each cell should be preceded by program operation. The mismatch between program and erase operation makes addition overhead for Flash memory, such as mapping management between logical address and physical address, and GC (Garbage Collection) [4]. The Flash Translation Layer (FTL) [5][6] is in charge of managing these issues. There are two major different points between two storage media of NAND flash-based and rotational-based that this paper has focused on; the one is there is additional internal overhead for Flash device, i.e. GC. The Second difference is that NAND flash-based storage media does not dependent on mechanical aspects any more. Therefore request processing manner should be differed for these two media. For these two issues, the empirical analysis is described at the following subsections one by one.

Inside the Flash device, GC makes available free region for later write requests. During GC operation, valid data should be copied from victim block to available other region. The GC efficiency is getting higher as the number of data to be copied is getting smaller. At the view of Flash memory, some area of Flash memory is considered as valid, even if the area does not contain valid data. FTL provides logical address to host system and hides physical address, whereas, host system has no idea about the real physical location of data. Even the host system thought that a data is considered as invalid, FTL might recognize the data is being valid, which is due to the file system’s metadata operation. For example, when a file is deleted by delete operation from user, file system deletes it by just deleting the metadata of the file, leaving data area of the file alive. In this case, the data is considered as valid within

(3)

area can be deleted physically. The discard command itself is another kind of overhead at the aspect of IO subsystem and device driver although it does not transfer data, since the command also occupies bandwidth. Thus, discard command can be used carefully.

We have analyzed the distribution of discard commands such as discard size, pattern, frequency, and affection on concurrent other requests, by generating file system’s IO operations with create/read/write/delete, as IO request distribution varies. Postmark benchmark [9] is used for 4 different configurations, as described in Table 1. In the workload generation, the file size increases and number of IOs decreases, as number of configuration increases. For each configuration running, we have four types of experiments with regards to discard commands, nodicard, home discard, journal discard, and over40 discard, which means discards are not generated, discards are generated only home region, discards are generated in both of home and journal region, and discards are generated only for the range is over 40 file systems’ blocks, respectively. During the experiments, request size, elapsed time, request sequence, and bandwidth are profiled for discard command.

Fig. 1.Discard Distribution for Configuration 1

Fig. 2. Discard Distribution for Configuration 2

(4)

Fig. 3. 3 Discard Distribution for Configuration 3

Fig. 4. Discard Distribution for Configuration 4

The Figures 1-4 depicts request distribution for generated discard commands for each configuration. For each configuration, the generated discard command and its discard range is plotted. From the figures, we identify that discard range for each discard is getting larger and sporadic as file size increases and number of files decreases. On the contrary, the discards with small range are generated frequently for configuration 1 and 2. We identify that the discards of small range give bad effects for other generic requests, which results in bandwidth decrement, while discards for large range give benefit for Flash device, which results in IO bandwidth increment, from the bandwidth results. The discard for journal region of file system, the discard range makes much more traffic for file system’s journaling operations.

From the profiling, we identify that discard command can have benefit if we use these carefully. For the discard commands that have large range enough to countervail the runtime bandwidth occupation, runtime discard generation is helpful for IO bandwidth, as well as reduces flash internal overhead such as GC. On the country, if the discards are generated with large range enough to give change to make large physical free region inside Flash, it is helpful for runtime operations. For the file system’s aspect, the discard does not disturb busy condition of file systems’ flow sequence. For instance, discards generated during journaling operation, or metadata operations have small range and these are dense in time sequence. In this case,

(5)

3 Conclusion

Flash-based storage devices, such as embedded Multi-Media Controller (eMMC) cards, have reached the mainstream market as storage solutions. Although the specification of eMMC has evolved enough to give much high bandwidth to host system, the host system cannot utilize well the distinct features of recent developed eMMC, which is due to the IO subsystem and device driver. In this paper, we have analyzed empirically the IO subsystem which uses eMMC as storage media, in the aspect of discard command feature, and derived several design issues of Flash-based IO subsystems.

References

1. Intel Corporation, "Intel® High Performance Solid State Drive - Advantages of TRIM". www.intel.com, 2010.

2. JEDEC, “Embedded Multimedia Card Electrical Standard”, Sep. 2013

3. Shu, F. Data set management commands proposal for ata8-acs2. In T13 Technical Committee, United States: At Attachment: e07154r1 (2007).

4. Micron, “Garage Collection in Single-Level Cell NAND Flash Memory”, Technocal Note, TN-2960.

5. Ban, A.: Flash file system, United States Patent, no.5,404,485, 1995.

6. Intel Corporation, “Understanding the flash translation layer(FTL) specification”, http://developer.intel.com/.

7. Chang, Y.-H., Wu, P.-L., Kuo, T.-W., Hung, S.-H., “An adaptive file-system-oriented FTL mechanism for flash-memory storage systems”, ACM Transactions on Embedded Computing Systems, Vol. 11, Issue 1, 2012.

8. Avantika Mathur, M. C., Bhattacharaya, S.: “The new ext4 filesystem: current status and future plans”, In Proceedings of the Linux Symposium (2007), pp. 21–33.

9. Katcher, J., “PostMark: A New File System Benchmark”, Technical Report TR3022, Network Appliance Inc., Oct. 1997.

References

Related documents

The three- phase project brought together a community of government officials, mobile industry professionals, and members of the general public to help design a mobile future for

Lubricate the bore plug seal groove and install the seal in the groove closest to the gripper bore (Figure

While some challenges (such as keeping material current) may be common to most agencies, the importance of these activities, or the risks involved in not managing particular types

The CNN method can produce 3D face images which are then processed using a shape and texture combination to produce a correlation of points on a new face image that has

Just to complicate matters further, a study carried out in the Netherlands (Janneke et al 2009) using data of 11 230 litters in 12 purebred dog breeds, found that litters produced

Even though the number of minority preschool aged males (45) and the number of picture books with male minority lead characters (43) indicated a 1:1 match of books to preschool

circular transition This nozzle directs water towards the rotor through a section which accesses a This nozzle directs water towards the rotor through a section which accesses a

socially charged issues. Researching service user views in forensic mental health: a literature review. How do we recover? An analysis of psychiatric survivor oral