Designing a Chat-bot for College Information using Information Retrieval and Automatic Text Summarization Techniques

Radha Guha

doi:10.2174/2665997201999201022191540

ISSN: 2665-9972
E-ISSN: 2665-9964

Designing a Chat-bot for College Information using Information Retrieval and Automatic Text Summarization Techniques
By Radha Guha¹
View Affiliations Hide Affiliations

¹ CSE Department, SRM University AP, Andhra Pradesh, India
Source: Current Chinese Computer Science, Volume 1, Issue 1, Apr 2021, p. 42 - 51
DOI: https://doi.org/10.2174/2665997201999201022191540
- Received: 05 Jun 2020
- Accepted: 21 Sep 2020
- Available online: 01 Apr 2021

Abstract

Background: In the era of information overload it is very difficult for a human reader to make sense of the vast information available on the internet quickly. Even for a specific domain like a college or university website, it may be difficult for a user to browse through all the links to quickly get the relevant answers.

Objective: In this scenario, the design of a chat-bot which can answer questions related to college information and compare between colleges will be very useful and novel.

Methods: In this paper, a novel conversational interface chat-bot application with information retrieval and text summarization skill is designed and implemented. Firstly, this chat-bot has a simple dialog skill; when it can understand the user query intent, it responds from the stored collection of answers. Secondly, for unknown queries, this chat-bot can search the internet, and then perform text summarization using advanced techniques of natural language processing (NLP) and text mining (TM).

Results: The advancement of NLP capability of information retrieval and text summarization using machine learning techniques of Latent Semantic Analysis (LSI), Latent Dirichlet Allocation (LDA), Word2Vec, Global Vector (GloVe) and TextRank is reviewed and compared in this paper first before implementing them for the chat-bot design. This chat-bot improves user experience tremendously by getting answers to specific queries concisely which takes less time than to read the entire document. Students, parents and faculty can get the answers for a variety of information like admission criteria, fees, course offerings, notice board, attendance, grades, placements, faculty profile, research papers, patents, etc. more efficiently.

Conclusion: The purpose of this paper was to follow the advancement in NLP technologies and implement them in a novel application.

Article metrics loading...

/content/journals/cccs/10.2174/2665997201999201022191540

2021-04-01

2025-07-03

From This Site

/content/journals/cccs/10.2174/2665997201999201022191540

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

References

FeldmanR. SangerJ. The text mining handbook: advanced approaches in analyzing unstructured data.New York, NYCambridge University Press2007
[Google Scholar]
HighR. The era of cognitive systems: an inside look at ibm watson and how it works.IBM Corporation2012
[Google Scholar]
ElderJ. MinerG. NisbetR. Practical text mining and statistical analysis for non-structured text data applications.Elsevier2012
[Google Scholar]
BerryM.W. Survey of text mining: clustering, classification and retrieval.Springer2007
[Google Scholar]
CollobertR. WestonJ. a unified architecture for natural language processing: deep neural networks with multitask learningproceedings of the 25th international conference on machine learning.2008, 160167Helsinki, Finland
[Google Scholar]
BengioY. DucharmeR. VincentP. JauvinC. A neural probabilistic language model.Journal of MLR.2003311371155
[Google Scholar]
TuringA.M. Computing Machinery and Intelligence.Mind195043346010.1093/mind/LIX.236.433
[Google Scholar]
BradeskoL. MladenicD. A Survey of chatbot systems through a loebner prize competitionproceedings of slovenian language technologies society eighth conference of language technologies2012, 3437
[Google Scholar]
DeerwesterS. DumaisS. FurnasG. LandauerT. HarshmanR. Indexing by latent semantic analysisJ. of JASIS199010.1002/(SICI)1097‑4571(199009)41:6<391::AID‑ASI1>3.0.CO;2‑9
[Google Scholar]
BleiD. NgA. JordanM. Latent Dirichlet Allocation.J. MLR200339931022
[Google Scholar]
BleiD. Griffiths, Jordan M., and Tannenbaum J., Hierarchical topic models and the nested chinese restaurant process. Advances in Neural Information Processing Syst.Cambridge, MAMIT Press2004
[Google Scholar]
BrettM. Topic modeling: a basic introduction.J. JDH2012
[Google Scholar]
GriffithsT.L. SteyversM. Finding scientific topicsProc. Natl. Acad. Sci. USA2004101Suppl. 15228523510.1073/pnas.030775210114872004
[Google Scholar]
RadhaG. Exploring the field of text miningJ. IJCA2017Vol. 975
[Google Scholar]
RadhaG. Exploring information retrieval by latent semantic and latent dirichlet allocation techniques.J. IRJCS2020
[Google Scholar]
RadhaG. Impact of artificial intelligence and natural language processing on programming and software engineering.J. IRJCS2020
[Google Scholar]
LiY. DavidM. ZuhairB. JamesD. O’SheaD. KeelC. Sentence similarity based on semantic nets and corpus statistics.IEEE Trans. Knowl. Data Eng.2006181138115010.1109/TKDE.2006.130
[Google Scholar]
MikolovT. SutskeverI. ChenK. CorradoG. DeanJ. Distributed representations of words and phrases and their compositionality.Adv. Neural Inf. Process. Syst.201331113119
[Google Scholar]
MikolovT. QuocV. SutskeverI. Exploiting similarities among languages for machine translationarXiv:1309 4168 [CS CL]2013
[Google Scholar]
PenningtonJ. SocherR. C.Manning Glove: global vector for word representationProceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)201415324310.3115/v1/D14‑1162
[Google Scholar]
RumelhartD.E. GeoffreyE.H. WilliamsR.J. Learning representations by back- propagating errors.Nature1986323608853353610.1038/323533a0
[Google Scholar]
ZhuT. KanL. The similarity measure based on lda for automatic summarization. Elsevier.. IWIEE2012
[Google Scholar]
LinC.Y. ROUGE: A Package for automatic evaluation of summaries. proceedings of the workshop on text summarization branches out.Barcelona, Spain2004
[Google Scholar]
LuhnH.P. The automatic creation of literature abstracts.IBM J. Res. Develop.19582215916510.1147/rd.22.0159
[Google Scholar]
DunningT. Accurate methods for the statistics of surprise and coincidence.Comput. Linguist.1993916174
[Google Scholar]
EdmundsonH.P. New methods in automatic extracting.J. Assoc. Comput. Mach.196910.1145/321510.321519
[Google Scholar]
MihalceaR. Text rank - bringing order into textsProceedings of the conference on empirical methods in natural language processing (EMNLP 2004)2004
[Google Scholar]
LinC.Y. ROUGE: A package for automatic evaluation of summaries. proceedings of the workshop on text summari-zation branches out.Barcelona, Spain2004
[Google Scholar]

/content/journals/cccs/10.2174/2665997201999201022191540

Designing a Chat-bot for College Information using Information Retrieval and Automatic Text Summarization Techniques

Current Chinese Computer Science 1, 42 (2021); https://doi.org/10.2174/2665997201999201022191540

/content/journals/cccs/10.2174/2665997201999201022191540

Data & Media loading...

Article Type: Research Article

Keyword(s): Chat-bot; GloVe; information retrieval; latent dirichlet allocation; latent semantic analysis; natural language processing; text mining; text summarization; textrank; topic modeling; word embedding; word2vec

Designing a Chat-bot for College Information using Information Retrieval and Automatic Text Summarization Techniques

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Weighted Aggregation Operators of Fuzzy Credibility Cubic Numbers and their Decision Making Strategy for Slope Design Schemes

The Performance Analysis and Monitoring of Grid-connected Photovoltaic Power Plant

Leaf Image Classification with the Aid of Transfer Learning: A Deep Learning Approach

Correlation Coefficients of Linguistic Neutrosophic Sets and their Multicriteria Group Decision Making Strategy for Medical Treatment Options

A Comprehensive Survey on Current Literature, Standards, ‎Applications and Projects of Self-Organizing Aerial Ad Hoc Network ‎‎(AANET) in Smart Cities

An Ensemble of Community Detection in Social Networks Using Clustering of Users Demographic and Topological Information