Skip Navigation Links

Project Information

CAREER: RELAXED CONTENT AND STRUCTURE QUERIES OVER HETEROGENEOUS DATA

Agency:
NSF

National Science Foundation

Project Number:
0844935
Contact PI / Project Leader:
MARIAN, AMELIE
Awardee Organization:
RUTGERS THE ST UNIV OF NJ NEW BRUNSWICK

Description

Abstract Text:
This award is funded under the American Recovery and Reinvestment Act of 2009 (Public Law 111-5).

The traditional separation between the "structure-only" Database world and the "text-only" Information Retrieval world is fading. Databases now routinely include text components while documents are being augmented with structural information. The goal of this project is to design novel techniques and develop tools to efficiently query and retrieve relevant information in a heterogeneous data environment where flexibility in conditions on both the content, the structure of the data, and the response to a query is desirable. The first main contribution of the project is the design of quality scoring mechanisms that unify content and structure score in an integrated fashion. The scoring techniques take into account the similarity between the query and the answer to assign scores. The second main contribution of the project is the development of heterogeneous data index structures and query processing algorithms to efficiently identify exact and approximate query answers and provide the answers in the order of relevance to a query. The work resulting from this project will be evaluated through an in-depth study of the impact of the scoring strategies on answer quality and performance experiments on the query processing techniques.

The results of this project are expected to enable users to identify the data that best fits their needs, in a variety of heterogeneous data environments, without requiring some preexisting knowledge of the underlying data schema or content. This project integrates research and education through curriculum development, student advising, and outreach to women in Computer Science. Results of this project, including publications, data sets and software will be made available on the project website (http://www.cs.rutgers.edu/~amelie/RelaxedQ/).
Project Terms:
Accounting; Algorithms; American; Award; computer science; Computer software; Data; Data Set; Databases; design; Development; Education; Educational Curriculum; Environment; flexibility; Funding; Goals; indexing; Information Retrieval; Knowledge; Laws; novel; outreach; Performance; Process; Publications; Recovery; Research; research study; response; Structure; Students; Techniques; Text; tool; web site; Woman; Work

Details

Contact PI / Project Leader Information:
Name:  MARIAN, AMELIE
Other PI Information:
Not Applicable
Awardee Organization:
Name:  RUTGERS THE ST UNIV OF NJ NEW BRUNSWICK
City:  NEW BRUNSWICK    
Country:  UNITED STATES
Congressional District:
State Code:  NJ
District:  06
Other Information:
Fiscal Year: 2009
Award Notice Date: 13-Jul-2009
DUNS Number: 001912864
Project Start Date: 15-Jul-2009
Budget Start Date:
CFDA Code: 47.082
Project End Date: 30-Jun-2014
Budget End Date:
Agency: ?

Agency: The entity responsible for the administering of a research grant, project, or contract. This may represent a federal department, agency, or sub-agency (institute or center). Details on agencies in Federal RePORTER can be found in the FAQ page.

National Science Foundation
Project Funding Information for 2009:
Year Agency

Agency: The entity responsible for the administering of a research grant, project, or contract. This may represent a federal department, agency, or sub-agency (institute or center). Details on agencies in Federal RePORTER can be found in the FAQ page.

FY Total Cost
2009 NSF

National Science Foundation

$499,759

Results

i

It is important to recognize, and consider in any interpretation of Federal RePORTER data, that the publication and patent information cannot be associated with any particular year of a research project. The lag between research being conducted and the availability of its results in a publication or patent award varies substantially. For that reason, it's difficult, if not impossible, to associate a publication or patent with any specific year of the project. Likewise, it is not possible to associate a publication or patent with any particular supplement to a research project or a particular subproject of a multi-project grant.

ABOUT FEDERAL REPORTER RESULTS

Publications: i

Click on the column header to sort the results

PubMed = PubMed PubMed Central = PubMed Central Google Scholar = Google Scholar

Patents: i

Click on the column header to sort the results

Similar Projects

Download Adobe Acrobat Reader:Adobe Acrobat VERSION: 3.41.0 Release Notes
Back to Top