• No results found

Description of the Oki System as Used for MET-2

N/A
N/A
Protected

Academic year: 2020

Share "Description of the Oki System as Used for MET-2"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

Description of the Oki System as Used for MET-2

J. Fukumoto, M. Shimohata, F. Masui, M. Sasaki

Kansai Lab., R&D group

Oki Electric Industry Co., Ltd.

Crystal Tower 1-2-27Shiromi, Chuo-ku,Osaka540-6025 JAPAN

ffukumoto,simohata,masui,[email protected]

INTRODUCTION

Thispap erdescribestheOkiInformationExtractionsystemasusedforMET-2evaluation(Japanese

task) [1][2]. System architectureof Oki MET system is basically thesame as the onefor the NE

system for MUC-7 evaluation although recognition rules of b oth systems are dierent b ecause of

languagedierence(Japanese and English).

OkiMET system [3][4]is basedon term recognition rulesof surface linguisticexpressions and

parse trees which is generated by the parsing mo dule of MT system. The MET system rstly

identies term b oundaries by Japanese character types. Named Entities are recognized using a

Japanese sux list and a wordlist for each NE type. After recognition of surfacepatterns, each

sentence of a text is parsed and NE elements are recognized by structural pattern rules. The

structural patternrecognitionrules aredescrib edinGDLand are executedon theGDLsystem.

SYSTEM DESCRIPTION

OkiMETsystemconsistsofasurfacepatternrecognitionmodule,aSGMLtagpro cessingmo dule,

a structural pattern recognition module and ltering program to convert internal expression of

parsertosurfaceexpression. Architectureof thesystemis showninFigure 1.

In the surface pattern recognition mo dule, the system rstly identies term boundaries by

Japanese character types such as Hiragana, Kanji, Number and so on. Then, NE elements are

recognizedusinga Japanesesuxlistand awordlistforeachNE typ e. ThedetectedNE elements

in a text are SGML-tagged. In the SGML tag pro cessor, these tags and original SGML tags in

a text are embedded in their adjacent words for morphological and syntax analysis. Each

tag-pro cessed sentenceis parsed bythemorphological and syntaxanalyzer which isoriginally usedin

theMTsystem. After parsing,structuralpatternsarerecognizedbystructuralpatternrecognition

rules, although the number of rules is not much in the current implementation. These rules are

describ edinGDLand areexecuted ontheGDLsystemafter parsing. Therecognizedpatternsare

(2)
(3)
(4)
(5)
(6)

[3] Masui, F., Tsunashima, T., Sugio, T., Tazo e, T. and Shiino, T.: \Analysis of Lengthy

Sen-tences Using an English Comparative Structure Mo del", System and Computers in Japan,

pp.40{48, SCRIPTATECHNICA Inc., (1996).

[4] PENS

EE, http://www.oki.co.jp/OKI/RDG/English/kikaku/vol.1/sugio/main.html

References

Related documents