ABSTRACT
The amount of information available today is extremely large. The increasing need for easier and faster information discovery demands optimal information retrieval techniques. A measure of performance of any information retrieval system is based on the effectiveness and efficiency of retrieval. While some techniques rely on algorithms that improve search, others aim at increasing user’s ability to formulate search queries. Here we present a non‐ predefined query model for information retrieval system based on a relational database.
TABLE OF CONTENTS
Abstract
Table of contents
List of figures
List of tables
CHAPTER ONE
1. Introduction
1.1 Research context
1.2 Problem statement
1.3 Research objectives
1.4 Research methodology
1.5 Organization of work
CHAPTER TWO
2. Literature review
2.1 Information Retrieval
2.2 Data or information retrieval?
2.3 IR Models
2.4 Boolean Retrieval
2.5 Data warehousing
2.6 Human-Computer Interface
2.6.1 User interfaces for search by Marti A. Hearst
2.6.2 Designing the User Interface by Ben Shneiderman
2.6.3 Models of Interaction
2.6.4 Design of Search Interfaces
2.7 Related Works
CHAPTER THREE
3. Research proposal
3.1 Information Retrieval System
3.2 Non-predefined Query Model
3.3 Interface for IR
3.4 Design Specifications
CHAPTER FOUR
4. Case study
4.1 Employee Records
4.2 Data Modeling
4.2.1 Entity-Relation (ER) Model
4.2.2 Graph of Relations
4.2.3 Dictionary of Attributes
4.2.4 Dictionary for Query
4.3 Cross Tabulation
CHAPTER FIVE
5. Conclusion
5.1 Contributions
5.2 Perspectives
References
CHAPTER ONE
INTRODUCTION
Information retrieval is obtaining information by searching a repository for items that match user’s information need. According to Losee (1998), retrieval systems often order documents in a manner consistent with the assumptions of Boolean logic, by retrieving, for example, documents that have the terms dogs and cats, and by not retrieving documents without one of these terms. Systems consistent with the probabilistic model of retrieval locate documents based on a query list of terms, such as {dogs, cats}, or may accept as input a natural language query, such as I want information on dogs and cats. A probabilistic system then ranks documents for retrieval by assigning a numeric value to each document, based on the weights for query terms and the frequencies of term occurrences in documents.
We want to know how to “best” formulate a query, and our ultimate interest in measures of human utility: how satisfied is each user with the results the system gives for each information need that they pose? Manning et al (2009). Most everyday users of IR systems expect IR systems to do ranked retrieval, unfortunately relevance ranking is often not critical in Boolean systems. On the other hand, most IR systems rank documents by their estimation of the usefulness of a document for a user query, and there is little or nothing a user can do about it. However, many power users still use Boolean systems as they feel more in control of the retrieval process.
It is correct that the set of retrieved documents are not ranked in Boolean searches. However, the cost of a ranked set is a set that is not fully controlled or understood by the user. In Boolean searches, the user obtains well‐defined search sets, which is a clear advantage if searching is considered a learning process. The well‐defined set provides better feedback and therefore allows modified search profiles. Hjørland (2014).
Conventional IR systems are built on the Boolean model while most IR systems rely on sophisticated algorithms for better ranking. Most of the materials for ranking are usually documents of an unstructured nature (usually text). Today, research in the field IR are split among various activities and between (a) optimizing algorithms for ranked systems and (b) extending the Boolean model to increase the selection power of users.
Researchers who are working on the storage side of the information retrieval system are engaged in designing sophisticated methods for identification and representation of the various bibliographic elements essential for documents, automatic content analysis, text processing and so on. On the other hand, researchers working on the retrieval side are attempting to develop sophisticated searching techniques, user interfaces, and various techniques for producing output for local as well as remote users. Chowdhury (2004).
1.1 Research context
The area of study is within the domain of Information Retrieval. IR is a broad area and an important aspect of life. The recognition of the important role information plays in our daily lives has led to an outburst of studies aimed at advancing the field of IR....
================================================================
Item Type: Project Material | Size: 48 pages | Chapters: 1-5
Format: MS Word | Delivery: Within 30Mins.
================================================================
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.