Trademark identification through text recognition

(1)

17

TRADEMARK IDENTIFICATION

THROUGH TEXT RECOGNITION

GHAZALISULONG ZAIDAH IBRAHIM

Faculty Of Computer Science & Information System University Technology Malaysia

Skudai, 80990 Johor Bahru

ABSTRACT

Trademark identification is of great interest in the document domain due to two reasons.

Given a document that consists of a trademark, the trademark needs to be conclude whether

present or not in the database and given a known trademark, we need to index into a database

of documents and extract all information related to it. This paper proposes a new method to

recognise the text on mixed-mode trademarks that consists of various font style and sizes,

focusing on uppercase Roman alphabets. A stroke-based feature extraction method

represented by chain codes is used. A modified rule-based classifier recognises the

characters with satisfactory results.

KEYWORDS : Chain Codes, Strokes, Character Recognition, Trademarks

ABSTRAK

Terdapat dua sebab mengapa topik mengenal pasti logo diminati di bidang dokumen.

Pertama, dengan adanya sebuah dokumen yang mengandungi logo, logo tersebut perlu

dikenal pasti samada terkandung di dalam pangkalan data atau tidak. Kedua, dengan adanya

logo yang telah dikenal pasti, kita perlu mencari di dalam pangkalan data dokumen dan

mencapai semua maklumat yang berkaitan dengan logo tersebut. Kertas keija ini

membincangkan satu kaedah baru untuk mengecam aksara yang beranika fon dan saiz yang

terdapat pada logo, khusus untuk aksara berhuruf besar. Kaedah berdasarkan pada strok

(2)

digunakan sebagai cara untuk pengambilan ciri-ciri menerusi penggunaan kod rantai. Teknik

pengaturan yang telah diubah suai telah digunakan sebagai kaedah untuk mengecam aksara

dan keputusan ujikaji amatlah memberangsangkan.

(3)

19

1.0 INTRODUCTION

Generally, trademarks are characterised as mixed text and graphic symbols and they

are associated with a given group or organisation. They represent a class of images that may

be useful for businessmen, graphic designers or mass communication operators. In the

document domain, trademark identification or recognition tasks are of great interest due to

two reasons. Firstly, given a document that consists of a trademark, the trademark needs to

be concluded whether it is present or not in the database. It is possible that new trademarks

may closely resemble existing ones. Secondly, given a known trademark, we need to index

into a database and extract all information related to the trademark or documents that contain

that trademark. This is an important issue as we may face with the problem of enormous

difficulties in material retrieving and automatic archiving. Text recognition can be one of the

ways for recognising the trademarks and used as index to databases.

There are many commercial devices capable of accurately reading clean machine

printed documents. The first commercial devices were developed in the late 1960s. They

were specialised machines of very high cost designed to handle large amounts of documents.

Matrix matching techniques have been the preferred approach since they are easier to

implement. The unknown image needs to be compared with a set of templates representing

known characters. However, it is not well adapted to size variations and font styles.

[Shlien 1988] developed a feature-based recognition algorithm that uses a decision

tree classifier with 197 binary features which flag the presence or absence of linear, convex,

concave and hole features. [Muelenaere et. al. 1992] developed a character recognition

technique using topological features where the character is described in terms of bars and

(4)

Al. 1992] developed a rule-based classifier which is based on stroke detection.

This paper proposed a character recognition technique that is based on a

combi nittiqa

methods developed by [Muelenaere et. al. 1992] and [Shih et. al. 1992]. This propoM?

technique reduces the number of feature vectors used to only 9.

2.0 FEATURE EXTRACTION

Trademarks consists of a combination of text and graphic symbols. And usually there

is only one graphic symbol and one word of text involved, and no interflow between them.

The work presented here is designed specifically for black (represented by l ’s) and white

(represented by ‘0’) trademarks that are within closed borders. The trademark is digitised by

a scanner and thresholded into binary image. Then, the binary image is smeared horizontally

and vertically to close up any broken borders. The borders are deleted by changing the white

pixels to black. Since the characters are nicely separated, character segmentation is based on

the occurrence of white columns or columns of 0’s. More discussions on character

segmentation can be found in [Lu, Yi 1995].

One of the most important aspects of any character recognition system is feature

extraction. Its purpose is to extract features from the character that would simplify the

recognition process. Basically, a character image consists of several lines or curve segments

(also called as strokes) that are drawn and adjoined at specific position which differentiates a

character from other characters [Shirvaikar et. al. 1988].

The feature extraction method proposed here is divided into two levels of analysis

where in the first level, a global analysis is performed and if ambiguity occurs, the second

level is invoked where a roagnified analysis is performed. In global analysis, the strokes are

(5)

analysed generally for its straightness where a slight curve or slant is ignored. But, in

magnify analysis, the details of the strokes are analysed where a slight curve or slant is

detected in determining the results.

In [Muelenaere et. Al. 1992], the binary image is scanned in 4 directions which is, left

to right, right to left, top to bottom and bottom to top to obtain the character projections,

outlines and stroke densities. Calculations are performed to produce the number of bars and

holes that present in the image. The image in this research is also scanned in 4 directions to

obtain 4 strokes, referred as Left, Right, Top and Bottom, represented as chain codes [Wilf

1981].

There are 8 possible directions between a point and its neighbour. Figure 1(a) shows

the 8 direction of chain codes and figure 1(b) shows an example of an image and its chain

codes

2

21

6

Fig. 1 (a) 8 direction of chain codes

(6)

Left chain codes

65656656566565665656656565665656656566565665655654544 Right chain codes

6766767676676767667676676767667676766767676767770706 Top chain codes

00

Bottom chain codes

0000000000001344322222121221211000000000000000000776766767666655445700000000000000000

Fig. 1(b) A sample of character A and its four strokes represented as chain codes.

2.1 STRAIGHT LINE DETECTION

Various algorithms have been developed to determine the straightness of chain codes

[Freeman 74, We 82, Hung 85]. Yuan et. al. [Yuan et. al. 1995] have developed an algorithm

to detect straight lines based on the quantization scheme. In this scheme, a passing area is

constructed around pixels so that a line formed by these pixels must pass through this area if

it is straight. The passing area is modified as a new point is accepted until a point is rejected

as a pixel on the straight line.

However, before the straight line detection process can be applied, the chain codes

needs to be cleaned or smoothed to delete any noises that may occur [Wilf 1981]. Smoothing

can be time consuming. Thus, to overcome this problem, the straight line detection

algorithm can be modified to cater this noise where a small inclination to the next direction

other than specified by Yuan would also be considered as part of the same chain code of a

straight line. This can be performed by using certain rules where the codes of the same

(7)

straight line are grouped together. For instance, a chain codes that consists of values 1, 0 or 2

belong to the same group, values 5, 6 or 7 belong to another group, and so on. By applying

this rule, the smoothing process can be discarded.

2.2 RULE-BASED CLASSIFIER

[Shih et. Al. 1992] developed a rule-based classifier where the character image is

compared with a character recogniser. Each character recogniser consists of a group of

characters that have similar topological structures. Once a rule is matched, the input

character is recognised. Otherwise, the next character recogniser will be tested.

This paper proposes a modified rule-based classifier where the differences in

characters among the different fonts are detected and deleted. For instance, the bottom of the

left stroke of font Times New Roman and Courier consists of a curve to the left. Thus, the

curve is detected and then ignored before straight line detection is performed.

The next step is to check whether the input image satisfies the common properties of

the group where the characters are divided into two groups based on the hole feature as

follows:

Group 1 - A, B, P, R, Q, D, O

Group 2 - C, E, F, G, H, I, J, K, L, M, N, S, T, U, V, W, X, Y, Z

The last step is to match the input image against each recognizer in the group. For

global analysis, each stroke is tested for the number of straight lines. A line with small

curvature or a perpendicular line is also considered as a straight line. For Group 1, ambiguity

occurs for characters D and O, and to differentiate between them, the left stroke is analysed

further under the magnify analysis level. Its top left stroke is tested for the occurrence of a

curve as character O has that feature but not character D.

23

(8)

For Group 2, two cases of ambiguity occurs and they are further divided into two

subgroups, that are, Z and T, and U, V and Y. Under the magnify analysis level, the left

chain codes for characters Z and T are looked at. Character T has a vertical line but not

character Z. The depth of the top stroke of character U and V is always deeper than the depth

of the top stroke of character Y. And to differentiate between characters U and V, a vertical

line is detected on the top stroke.

3.0 EXPERIMENTAL RESULTS

Trademarks that consists of machine printed character sets of the Arial, Times New

Roman and Courier fonts served as test sets for the experiments. 20 different trademarks

have been tested with 122 characters from the three different fonts. The recognition rate

obtained was 96%. Errors occur in recognising a few samples of characters G and S. This is

because the difference occur in the middle of the stroke and the algorithm could not detect it.

See fig. 3 for some samples of trademarks where the text have been successfully recognised.

Character G in this sample is recognisable by the algorithm.

4.0 CONCLUSION

The proposed algorithm for character recognition described in this paper has produce

satisfactory results. This technique able to cater only 9 feature vectors, that are, horizontal

and vertical lines on all four strokes, plus the hole feature. Reducing the number of features

greatly reduces the memory size needed by the software. Nonetheless, more research is

being done to make this algorithm more flexible and improve the recognition rate.

(9)

Fig. 3 Some samples of trademarks whose text have been correctly recognised.

(10)

5.0 REFERENCES

1. Bai, G. “Multifont Chinese Character Recognition Using Side-Stroke-End-Feature”, Proc. IEEE, 1993, pp. 794-797.

2. Hung, S.H. Y. “On The straightness Of Digital Arcs”, IEEE Trans. PAMI, vol. PAMI-7, No. 2, March 1985, pp. 203-215.

3. Freeman, Herbert. “Computer Processing Of Line Drawing Images”, Computer Surveys, Vol. 6, No. 1, March 1974, pp. 57-97.

4. Muelenaere, P. De; Dauw, M.; and Legat, J.D. “Omnifont Recognition Of Text Using Topological Recognition Techniques”, Proc. IEEE, 1992, pp. 410-413.

5. Shih, F.; Chen, S.; Hung, D.; and Ng. P. “A Document Segmentation, Classification And Recognition system”, Proc. IEEE, 1992, pp. 258-267.

6. Shlien, S. “Multifont Character Recognition For TypeSet Documents”, Int’l. Journal Of Pattern Recognition & Artificial Intelligence, Vol. 2, No. 1, March 1988, pp. 603-620. 7. Shirvaikar, M.V. and Musavi, M.T. “A Stroke Feature Distribution Scheme For The

Recognition Of Alphanumeric Characters”, Proc. IEEE, 1988, pp. 192-195.

8. We, Le-De, “On The Chain Code Of A Line”, IEEE Trans. PAMI, vol. PAMI-4, No. 3, May 1982, pp. 347-353.

9. Wilf, J.M. “Chain Codes”, Robotics Age, Vol. 3, No. 2, Mar/Apr 1981, pp. 12-19.

10. Yuan, J. and Suen, C. “An Optimal O(n) Algorithm For Identifying Line Segments From A Sequence Of Chain Codes”, Pattern Recognition, Vol. 28, No. 5,1995, pp. 635-646.