IMPLEMENTASI CONTENT-BASED RETRIEVAL PADA PERPUSTAKAAN DIGITAL BERBASIS OPEN SOURCE MENGGUNAKAN APACHE LUCENE

David David

Abstract


Abstract : Applications are built can be used to classification search result documents and documents search easier. Documents that are only for the article from journal, thesis, ebook and other documents. Indexing and searching documents using Lucene as a search engine. An filing text document often needed for finding document that content special word or combination of some words. In this research, is made application that can save and retrieve text document, using java program language, db4O and Lucene Library. For efficient in saving, data stop word eliminated, and take account the existent of synonim words. In retrieving document it is possible using operator AND, OR and NOT with the number of words priority that exist in that document. The process of searching are divided into two, namely Simple Search and Advanced Search. Simple Search using a query to search Db4o while Advanced Search using the search terms in the index using Lucene library. In this system the test results obtained are accurate.
Keywords : Apache Lucene, Indexing, TF-IDF


Keywords


Apache Lucene; Indexing; TF-IDF;

Full Text:

PDF

References


Db4o-5.2 Tutorial, db4objects Inc., USA

Irwanto, Djon., 2007, Membangun Object Oriented Software dengan Java dan Object

Database, PT Elex Media Komputindo, Jakarta

DRTC-HP

International Workshop on Building Digital Libraries using DSpace, 7th 11th

March, 2005, DRTC, Bangalore.

Seki, Y., 2003, "Sentence Extraction by tf/idf and Position Weighting from Newspaper Articles", Proceeding of the Third NTCIR Workshop, National Institute of

Informatics

Subrata, Gatot., 2009, Perpustakaan Digital,

http://library.um.ac.id/images/stories/pustakawan/kargto/Perpustakaan%20Digit

al.pdf, diakses pada tanggal 24 Maret 2009

-IDF approach for text

Journal of Zhejiang University SCIENCE, 2005 6A(1):49-55,

ISSN 1009-3095.

The Apache Software Foundation, 2 Apache Lucene Overview

http://lucene.apache.org/java/docs/, diakses pada tanggal 19 Februari 2009.

http://www.scribd.com/doc/3020850/Perpustakaan-Digital-dan-Sistem-OtomasiPerpustakaan, diakses tanggal 24 Maret 2009.




DOI: http://dx.doi.org/10.30700/jst.v1i2.9

Article Metrics

Abstract view : 159 times
PDF - 252 times

Refbacks

  • There are currently no refbacks.


Badan Pengelola Jurnal Ilmiah Sistem Informasi dan Teknik Informatika (SISFOTENIKA) STMIK Pontianak.

 

Jurnal Ilmiah SISFOTENIKA terindex di :


   

   

  

    

    

    

   

 

 

 

ISSN Printed : 2087-7897

ISSN Online : 2460-5344


SERTIFIKAT PENGHARGAAN :

Jurnal Ilmiah SISFOTENIKA Terakreditasi Peringkat Empat

 

Partners & Co-Organizers:




Lisensi Creative Commons

Jurnal Ilmiah SISFOTENIKA: STMIK Pontianak Online Journal ISSN Printed (2087-7897) - ISSN Online (2460-5344) licensed under a Lisensi Creative Commons Atribusi 4.0 Internasional. Flag Counter

View My Stats>