Research >> The ILIT R&D Methodology

R&D can be method-oriented or task-oriented. Many R&D groups in academia tend to concentrate on particular methods and techniques – algorithms, logics, formal languages, etc. – and then seek applications for evaluating and testing these methods. Task-oriented research starts with a real-world problem and then devise methods to best tackle it. (This distinction is, to a degree, an abstraction; in reality, many more factors play a role in the choice of research paradigm.) Both basic and applied research at ILIT is guided by our belief that the real breakthrough in language technologies will take place only if the computer systems will gain the capability to understand texts, not just to manipulate and compare uninterpreted or underinterpreted symbol strings on the basis of their occurrence in texts.

To attain this capability, computer systems must become self-aware and aware of the world that surrounds them. They must internalize descriptions of kinds of events and objects (both physical and mental) that exist in the world; they must be able to model their own goals and plans; they must also be aware of other intelligent agents and their goals, plans and knowledge. Finally, in order to model intelligent communication with humans, the systems must internalize the knowledge about one or more human languages, including the connections between elements of language and elements of their internal world model.

In view of the above, the main line of ILIT’s basic research is devoted to the many and various facets of extracting and manipulating meaning in natural languages. Since our goal is to build comprehensive working systems (rather than develop limited-purview theories), we cannot work on semantics in isolation; determination and manipulation of meaning relies on computational descriptions of several non-semantic strata of language: ecology, lexis, morphology and syntax – in addition to semantics proper. Our approach is strictly computational – we are developing algorithms and building systems for text analysis that result in meaning specification as well as for text synthesis on the basis of its meaning. Our approach integrates rule-based and corpus-based methods, in that the conditions in our processing rules may include both linguistically and ontologically motivated constraints and results of frequency and co-occurrence analysis of text elements in corpora.

Our basic scientific paradigm, incorporating all of the above facets, is embodied in the theory of ontological semantics. This theory is realized as a society of “microtheories” of particular language and world phenomena. This arrangement helps to organize the vast descriptive and computational enterprise. It also makes it possible to “import” descriptions available for particular phenomena and then adapt them for our needs. At the center of our current basic research are a number of lexical-semantic microtheories as well as the study of reference.

Adding treatment of meaning in various languages to the battery of existing natural language processing technologies is the mainstay of ILIT’s applications research.

Development of tools and knowledge resources to support our basic and applied research has been an important component of our work.

ILIT University of Maryland Baltimore County ECS 202 1000 Hilltop Circle Baltimore, MD 21250
Phone: 410-455-8480 Fax: 410-455-8488 E-mail: ILIT@UMBC.EDU