|
Huong Le Thanh, Ph.D.
Associate Professor
Department of Information
Systems, School of Information and Communication Technology
Hanoi University of Science and Technology, Vietnam
Tel: +84 (0)4 38696124
Email: huonglt@soict.hust.edu.vn,
huongthanh@gmail.com |
Research
Interest
Projects
Open corpus from our projects
Demo
Publications
Teaching
(dành cho sinh viên)
Research direction for students
Miscellaneous
Research
Interest
- Computational Linguistics:
syntactics, semantics, text and discourse theories, question answering, natural
language generation, summarization, machine translation, information extraction
- Applications of NLP
technologies to other domains
- Data Mining and Knowledge
Discovery
- Expert Systems and
Knowledge Acquisition
Projects
2016-2017. Hệ thống lưu trữ và kiểm trùng tài liệu. (Chief Investigator). Funded
by Hanoi University of Sicence and Technology.
2011-2013. Investigate some methods for automatically summarizing text, applying
for Vietnamese language. (Chief Investigator). Funded by the Ministry of Education
and Training.
2011-2013. Improving Genetic Programming Learning with Applications in Natural
Language Processing. Funded by NAFOSTED.
2011-2013. Research and develop techniques for implementing domain-depended
search engine. A goverment protocol project.
2009-2011. Logical approaches in information representation and processing.
Funded by NAFOSTED.
2009 - 2010. Implementing a system to gather and extract information from Internet.
(Chief Investigator). Funded by the Ministry of Education and Training.
2009 - 2011. Research and implement intelligent robots, applying in exploiting
multimedia information. A government project.
2007-2009. Research and develop
an open source Vietnamese syntactic parser for public usage - a branch project
of the government project "Construct typical and essential products on Vietnamese
text and speech processing".
2006 - 2008. NLI4DB - A Natural Language Interface for Querying
Database and Automatically Generating Reports (Chief Investigator). Funded
by the Flemish Interuniversity Council for University Development Cooperation
(VLIR UOS).
2006-2007. Develop a system using natural language to query database and automatically
generate reports. Funded by the Department of Science and Technology, Hanoi.
Open
corpus from our projects
- Vietnamese text summarization
corpus (kho ngữ liệu tóm tắt văn bản tiếng Việt)
textsum_LTH (2014) from the project "Investigate some methods for
automatically summarizing text, applying for Vietnamese language. Funded by
the Ministry of Education and Training".
Demo
Publications
Journals
- Luu Minh Tuan, Le Thanh Huong, Hoang Minh Tan (2021). Một
phương pháp kết hợp các mô hình học sâu va kỹ thuật học tăng cường hiệu quả
cho tóm tắt văn bản hướng trích rút. Tạp chí Khoa học và Công nghệ Đại
học Thái Nguyên, T. 226, S. 11 (2021), trang 208-215.
- Tuan Luu Minh, Huong Le Thanh, Tan Hoang Minh (2021). A
hybrid model using the pre-trained BERT and deep neural networks with rich
feature for extractive text summarization. Journal of Computer Scienceand
Cybernetics. Volume
37, No.2, pp. 123-143.
- Nguyen Van Son, Le Thanh Huong, Nguyen Chi Thanh (2021). A
two-phase plagiarism detection system based on multi-layer LSTM Networks.
IAES International Journal of Artificial Intelligence.
- Minh-Tuan Luu, Thanh-Huong Le, Minh-Tan Hoang (2021). An
effective deep learning approach for extractive text summarization. Indian
Journal of Computer Science and Engineering. Vol. 12, No. 2, 2021, e-ISSN:0976-5166,
p-ISSN:2231-3850, pp.434-444.
- Nguyen Van Son, Le Thanh Huong, Nguyen Chi Thanh (2018). Phương
pháp trích rút từ khóa tìm tập ứng cử trong bào toán phát hiện đạo văn. Tạp
chí Nghiên cứu KH&CN quân sự, Số Đặc san CNTT, trang 27-35, 11-2018.
- Huong Thanh Le, Nhan Trong Tran (2018). A
Sentiment Analyzer for Informal Text in Social Media. Journal of Science
& Technology 131 (2018) , pp. 6-12.
- Nguyen Van Son, Le Thanh Huong, Nguyen Chi Thanh (2017). Automatic
keyword extraction using artificial neuron network and feature extraction.
Journal of Military Science and Technology, Special Issue, No.48A.
- Pham Minh Chuan, Le Hoang Son, Mumtaz Ali, Tran Dinh Khang, Le Thanh Huong,
Nilanjan Dey (2017). Link Prediction in Co-authorship Networks based on Hybrid
Content Silarity Metric. Applied Intelligence, ISSN: 0924-669X. doi: 10.1007/s10489-017-1086-x.
(SCI, 2016 IF = 1.904, Springer)
- Phạm Minh Chuẩn, Trần Đình Khang, Lê Thanh Hương, Trần Mạnh Tuấn, Lê Hoàng
Sơn (2017). Dự đoán liên kết đồng tác giả sử dụng phân cụm bán giám sát mờ.
Chuyên san Khoa học Tự nhiên – Kỹ thuật – Công nghệ (Đại học Thái Nguyên),
Tập 173, số 13, ISSN 1859-2171, trang 45-50.
- Phạm Minh Chuẩn, Lê Hoàng Sơn, Trần Đình Khang, Lê Thanh Hương (2017). Đề
xuất mô hình khuyến nghị cộng tác mới cho mạng đồng tác giả dựa trên chỉ số
cộng tác và tương quan. Tạp chí Khoa học và Công nghệ Việt Nam, Tập 22, Số
11, 11.2017, ISSN 1859-4794, trang 9-14.
- Le Thanh Huong, Sam Chanrathany,
Nguyen Thanh Thuy, Nguyen Thanh Long, Trinh Minh Dung (2014). Relation extraction
in Vietnamese text using label propagation. Journal of Informatics and Cybernetics,
Volume 30, No.1, pp. 15-27
- Sam Chanrathany, Le Thanh
Huong, Nguyen Thanh Thuy, Nguyen Huu Thien (2012). Automatic information extraction
in Vietnamese text. Journal of Informatics and Cybernetics.
- Phạm Minh Chuẩn , Lê Thanh Hương, Trần Đình Khang, Trần Ngọc Cương (2011).
Hệ thống gợi ý bài báo. Tạp chí
nghiên cứu KH&CN quân sự.
- Le Thanh Huong. 2007.
An approach in automatically generating discourse structure
of text. Journal of Computer Science and Cybernetics, Vietnam. Volume
23(3), pp. 212-230.
- Le Thanh Huong. 2007.
An approach to automatically generate different presentations
of natural language paraphrases. Journal of Posts, Telecommunications
and Information Technology. Volume 18 (3), pp. 74-82.
- Le Thanh Huong, Pham Hong
Quang, and Nguyen Thanh Thuy. 2000. An approach to automatically analyze syntax of Vietnamese text.
Journal of Informatics and Cybernetics, Volume 15, No.4.
Conferences/Workshops
- Ha Nguyen Tien, Dat Nguyen Huu, Huong Le Thanh, Vinh Nguyen Van and Minh
Nguyen Quang (2021). KC4Align: Improving Sentence Alignment method for Low-resource
Language Pairs. PACLIC 2021.
- Huong T. Le, Dung T. Cao, Trung H. Bui, Long T. Luong and Huy Q. Nguyen
(2021). Improve Quora Question Pair Dataset
for Question Similarity Task. RIVF 2021.
- Huong T. Le, Que X. Bui (2021). Keyphrase
Extraction Using PageRank and Word Features. RIVF 2021.
- Hai Cao Manh, Huong Le Thanh, Tuan Luu Minh (2019). Extractive
Multi-document Summarization using K-means, Centroid-based Method, MMR, and
Sentence Position . SoICT 2019.
- Viet Nguyen Quoc, Huong Le Thanh and Tuan Luu Minh (2019). Abstractive
Text Summarization using LSTMs with Rich Features. In: Nguyen LM., Phan
XH., Hasida K., Tojo S. (eds) Computational Linguistics. PACLING 2019. Communications
in Computer and Information Science, vol 1215., pp. 28-40. Springer, Singapore.
https://doi.org/10.1007/978-981-15-6168-9_3
- Nguyen Thi Thu Trang, Le Thanh Huong, Duong Viet Hung (2017). Enhancing
extractive summarization using non-negative matrix factorization with semantic
aspects and sentence features. SoICT 2017, pp.78-83
- Huong T. Le, Son V. Nguyen, Lam N. Pham, Duy D. Nguyen and An N. Nguyen.
2016. Semantic Text Alignment based on Topic Modeling.
RIVF 2016. (Best runner paper award)
- Huong Le Thanh, Luan Tran Van, Hoai Nguyen Xuan, Hien Nguyen Thi. 2015.
Optimizing Genetic Algorithm in Feature Selection
for Named Entity Recognition. SoICT 2015.
- Phạm Minh Chuẩn, Lê Thanh Hương, Trần Đình Khang, Nguyễn Văn Hậu. 2015.
Hệ thống gợi ý sử dụng thuật toán tối ưu bầy đàn. Hội nghị KHQG NCCB &
Ứng dụng CNTT (FAIR'8)
- Huong Thanh Le, Rathany Chan Sam, Hoan Cong Nguyen, Thuy Thanh Nguyen. 2013.
Named Entity Recognition in Vietnamese Text
Using Label Propagation. In Procs. of the 5th International Conference
of Soft Computing and Pattern Recognition (SoCPaR 2013)
- Huong Thanh Le, Tien Manh Le. 2013. An
approach to Abstractive Text Summarization. In Procs. of the 5th International
Conference of Soft Computing and Pattern Recognition (SoCPaR 2013)
- Huong Thanh Le, Luan Van Tran. 2013. Automatic Feature
Selection for Named Entity Recognition Using Genetic Algorithm. In Proceedings
of the 4th Symposium on Information and Communication Technology (SoICT 2013).
- Đỗ Bá Lâm, Lê Thanh Hương. 2012. Xây dựng CSDL về cộng đồng nghiên cứu CNTT
Việt Nam. Kỷ yếu chương trình hội thảo quốc gia lần thứ 15 “Một số vấn đề
chọn lọc của CNTT&TT”.
- Chan Rathany Sam, Huong Thanh Le, Thanh Thuy Nguyen, Anh Dung Le and Thi
Minh Ngoc Nguyen. 2011. Semi-Supervised Learning for Relation Extraction in
Vietnamese Text. In Proceedings of the Symposium on Information and Communication
Technology (SoICT), Oct. 13-14, 2011, Hanoi, Vietnam
- Rathany Chan Sam, Huong Thanh Le, Thuy Thanh Nguyen, Thien Huu Nguyen. 2011.
Combining Proper Name-Coreference with Conditional Random Fields for Semi-supervised
Named Entity Recognition in Vietnamese Text. The 15th Pacific-Asia Conference
on Knowledge Discovery and Data Mining (PAKDD), pp. 512-525.
- Huong Thanh Le, Rathany Chan Sam and Phuc Trong Nguyen. 2010. Extracting
Phrases in Vietnamese Document for Summary Generation. The International Conference
on Asian Language Processing (IALP), Dec 28-30, 2010, Harbin, China.
- Rathany Chan Sam, Huong Thanh Le, Thuy Thanh Nguyen, The Minh Trinh. 2010.
Relation Extraction in Vietnamese Text using Conditional Random Fields.The
Sixth Asia Information Retrieval Societies Conference (AIRS), Dec.1-3, 2010,
Taipei, Taiwan.
- Huong Thanh Le, Lam Ba Do, Nhung Thi Pham. 2010. Efficient Syntactic Parsing
with Beam Search. The 2010 IEEE RIVF conference, Nov. 01-04, 2010, Hanoi,
Vietnam.
- Huong Thanh Le, Thien Huu Nguyen. 2010. Name Entity Recognition using Inductive
Logic Programming. In Proceedings
of the Symposium on Information and Communication Technology (SoICT),
Aug. 27-28, 2010, Hanoi, Vietnam
- Dung Dao T., Huong Le T. 2008. Applying Information Extraction To Automatic
Web Advertising. In Proceedings
of the International Conference on Asian Language Processing (IALP),
Nov. 12-14, 2008, Chiang Mai, Thailand
- Lam Do B., Huong Le T. 2008. Implementing A Vietnamese Syntactic Parser
Using HPSG. In Proceedings
of the International Conference on Asian Language Processing (IALP),
Nov. 12-14, 2008, Chiang Mai, Thailand.
- Anh Nguyen Kim, Huong
Thanh Le. 2008. Natural Language Interface Construction using Semantic Grammars.
The 10th Pacific Rim International Conference on Artificial Intelligence (PRICAI),
Hanoi, Vietnam.
- Nguyen Trong Phuc, Le
Thanh Huong. 2008. Vietnamese text summarisation using discourse structures.
The ICT.rda conference, Hanoi, Vietnam.
- Do Ba Lam, Le Thanh Huong.
2008. Implementing a Vietnamese syntactic parser using HPSG. In Proceedings
of the ICT.rda conference, Hanoi, Vietnam.
- Do Ba Lam, Le Thanh Huong.
2008. Improvement of Earley parsing for Vietnamese language. The ICT.rda conference,
Hanoi, Vietnam.
- Le Thanh Huong. 2007.
A study on Vietnamese Syntactic Parsing. In Proceedings of the 4th International
AIST-VAST Scientific Workshop, Hanoi, Vietnam.
- Huong LeThanh. 2007. A frame-based approach to Text Generation. In Proceedings
of the PACLIC21 conference, Seoul, Korea, Nov. 1-3, 2007
- Nguyen Quoc The, Le Thanh
Huong. 2007. Vietnamese syntactic parsing using the Lexicalized Probabilistic
Context-free Grammar. In Proceedings of the FAIR conference, Nha Trang,
Vietnam, Aug. 9-10, 2007.
- Le Thanh Huong. 2007.
A study on generating natural language answers from query's result tables.
FAIR conference, Nhatrang, Vietnam, Aug. 9-10, 2007.
- Huong LeThanh and Geetha
Abeysinghe. 2004. Using Syntactic Structures and Cohesive Devices in Recognizing
Discourse Structure of Text. In Proceedings of the Joint Workshop of Vietnamese
Society of AI, SIGKBS JSAI, ICS IPSJ and IEICE SIGAI on Active Mining, Hanoi,
Vietnam, Dec. 4 7, 2004.
- Huong LeThanh, Geetha
Abeysinghe, and Christian Huyck. 2004. Generating Discourse Structures for Written Texts.
In Proceedings of the International Conference on Computational Linguistics
(COLING 2004), Geneva, Switzerland.
- Huong LeThanh, Geetha
Abeysinghe, and Christian Huyck. 2004. Automated Discourse Segmentation by Syntactic Information
and Cue Phrases. In Proceedings of the IASTED International Conference
on Artificial Intelligence and Applications (AIA 2004), Innsbruck, Austria.
- Huong T. Le and Geetha
Abeysinghe. 2003. A Study to Improve the Efficiency
of a Discourse Parsing System. In Proceedings of the 4th International
Conference on Intelligent Text Processing and Computational Linguistics (CICLing'03),
Mexico. Lecture Notes in Computer Science (LNCS) N 2588, A. Gelbukh, Ed.,
Springer Verlag.
- Huong LeThanh, Geetha
Abeysinghe, and Christian Huyck. 2003. Using Cohesive Devices to Recognize Rhetorical Relations
in Text. In Proceedings of 4th Computational Linguistics UK Research Colloquium
(CLUK 4), University of Edinburgh, UK.
- Nguyen Thanh Thuy, Nguyen
Quoc Dinh, Nguyen Quoc Khanh, Nguyen Huu Duc, Le Thanh Huong, Phan Dinh Hieu,
Nguyen Quang Huy, Dinh Lan Anh. Building a virtual parallel computing system
to apply in image processing. In Proceedings of Conference of Selective Problems
in Information Technology, Hue city, Vietnam, 1999.
Theses
- Automatic Discourse Structure
Generation Using Rhetorical Structure Theory. 2004. Ph.D. dissertation, Middlesex
University, U.K.
- Robotics - Unsupervised
Learning for Controlling a Robot-hand. 2001. Master thesis, Free University
of Brussels (VUB), Belgium.
- Vietnamese syntactic parser.
1999. Master thesis, Hanoi University of Technology.
- Applying Groupware Technology
to developing distributed database systems. 1997. Undergraduate thesis, Hanoi
University of Technology.
Teaching
Master courses
Undergraduate courses