VLIR-HUT Research Project
Ap06\Prj3\Nr04


Project title

NLI4DB: A Natural Language Interface for Querying Database and Automatically Generating Reports

Duration: 2 years (April 1, 2006 - March 31, 2008)

Promoter-spokesperson: Dr. Le Thanh Huong
Co-promoters: Dr. Nguyen Kim Anh, Dr. Vu Tuyet Trinh

Objective

This project aims at developing methods for querying databases using natural language (e.g., "Who is the leader of the class BK20 in the academic year 2004-2005?") and generating summary reports in different formats: tables, text, graphics, or a combination of them. The output of this research is a tool that can integrate with existing Database Management Systems (DBMSs) so as to provide a Natural Language Interface for Querying Database and Automatically Generating Reports (NLI4DB). The result is to improve quality of data processing and to reduce workload for official staffs. An overview of the architecture of the proposed tool is shown in Figure 1.

To achieve these objectives, three major technical problems are needed to solve:

This is a two year project but conceived as the initiate point of a long term program of research. The cooperation with national and international research centers will be continued to optimize research theories and results. The research results will be published in various academic fora such as journal papers, conference's papers and technical reports. This project builds on an existing (informal) international collaboration and will strengthen the cooperation between national scientists with the leading scientists in the area and in the world.

The final (long-term) goal of this project is to establish technologies and to produce a robust framework for the proposed NLI4DB, which can be used in different application areas (e.g., health services, insurance, crime investigation, etc.). Due to the limitation of time, the implementation of this project will focus on a student management database.

Publications

Journals

  1. Le Thanh Huong. An approach in automatically generating discourse structure of text. Journal of Computer Science and Cybernetics, volume 23(3), pp. 212-230, 2007, Vietnam.
  2. Le Thanh Huong. An approach to automatically generate different presentations of natural language paraphrases. Journal of Posts, Telecommunications and Information Technology, volume 18 (3), pp. 74-82, 2007, Vietnam.

Conferences

  1. Anh Nguyen Kim, Huong Thanh Le. Natural Language Interface Construction using Semantic Grammars. The 10th Pacific Rim International Conference on Artificial Intelligence (PRICAI), Dec. 15-19, Hanoi, Vietnam.
  2. Huong LeThanh. A frame-based approach to Text Generation. In Proceedings of the PACLIC21 conference, Nov. 1-3, 2007, Seoul, Korea.
  3. Le Thanh Huong. A Study on Vietnamese Syntactic Parsing. Proceeding of the 4th VAST-AIST workshop on science and technology cooperation. 2007. Vietnam.
  4. Nguyen Quoc The, Le Thanh Huong. Vietnamese syntactic parsing using the Lexicalized Probabilistic Context-free Grammar. FAIR conference, 2007, Nha Trang, Vietnam.
  5. Le Thanh Huong. A study on generating natural language answers from query's result tables. FAIR conference, 2007, Nhatrang, Vietnam.
  6. Vu Tuyet Trinh, Do thi Ngoc Quynh. Improving search result based on context. FAIR conference, 2007, Nhatrang, Vietnam.
  7. Nguyen Kim Anh. Translating the conceptual graph queries into SQL queries. Proceeding of the 20th scientific conference Hanoi University of Technology, pp.111 - 117, 2006, Hanoi, Vietnam.
  8. Nguyen Kim Anh, Pham Thi Thu Hoai. 2006. Imperfect natural language queries to relational databases. Proceeding of the 20th scientific conference Hanoi University of Technology, pp 117 - 122, 2006, Hanoi, Vietnam.
  9. Nguyen Kim Anh. Translating the logical queries into SQL queries in natural language query systems. Proceeding of ICT.rda, 2006. Hanoi, Vietnam.