Skip Navigation Links

Project Information

STUDENT TRAVEL SUPPORT FOR 2017 WEB ARCHIVING DATA WORKSHOPS

Agency:
NSF

National Science Foundation

Project Number:
1723430
Contact PI / Project Leader:
WEBER, MATTHEW S
Awardee Organization:
RUTGERS THE ST UNIV OF NJ NEW BRUNSWICK

Description

Abstract Text:
Funding will provide graduate student support for two workshops Web Archiving Data Workshops in 2017. The workshops focus on analyzing archived Web data. Such data sources provide unique opportunities to examine social science phenomena that have evolved over time, such as how news media is presented on the Web or how individuals interact in online discussions. On the other hand, such data are often difficult to access and challenging to analyze due to the fact that the data were often captured using sporadic or random approaches, due to data formats that are opaque and not always standardized, and due to the large size of the data (mid-sized collections can be on the terabyte scale). The workshops will educate graduate students with regards to combinations of code and research questions that can be used to design appropriate research studies and to then access and analyze this type of data. The focus of the workshops is interdisciplinary, bringing together research interests from disciplines as diverse as computer science, social science and the humanities Participants will work hands-on with data, will learn new programming skills, and will develop pilot projects for new research that they will be able to continue at their home institutions. For example, workshop participants will learn about the Warcbase software package (based on Spark and Scala), and will also have the opportunity to analyze data using in-browser Python notebooks. The first workshop will take place in February 2017 at the Internet Archive in San Francisco, CA, and the second workshop will take place in June 2017 at the British Library in London, UK. The first workshop will focus on the use of application programming interfaces for interoperability between collections and institutions. The second workshop will focus on establishing international collaborations between North American and European researchers, and will include a discussion of how to utilize archived Web data with other types of data such as socioeconomic variables. Data used and projects developed during the workshops will be published to the workshop website, and will also be made available via a GitHub repository. A final report will be produced summarizing the results of the workshop, and reviewing current state of the art research pertaining to archived Web data. This will be a significant contribution to the field as it is beginning to coalesce.
Project Terms:
American; application programming interface; Archives; base; British; Code; Collaborations; Collection; computer science; Computer software; Data; data archive; data format; Data Sources; design; Discipline; Educational workshop; European; Funding; graduate student; Home environment; Humanities; Individual; Institution; interest; International; Internet; interoperability; Learning; Libraries; London; Names; news; Participant; Pilot Projects; Publishing; Pythons; Reporting; repository; Research; Research Personnel; research study; San Francisco; skills; Social Sciences; socioeconomics; Standardization; Students; terabyte; Time; Travel; web site; Work

Details

Contact PI / Project Leader Information:
Name:  WEBER, MATTHEW S
Other PI Information:
Not Applicable
Awardee Organization:
Name:  RUTGERS THE ST UNIV OF NJ NEW BRUNSWICK
City:  PISCATAWAY    
Country:  UNITED STATES
Congressional District:
State Code:  NJ
District:  06
Other Information:
Fiscal Year: 2017
Award Notice Date: 21-Feb-2017
DUNS Number: 001912864
Project Start Date: 15-Feb-2017
Budget Start Date:
CFDA Code: 47.070
Project End Date: 31-Jan-2018
Budget End Date:
Agency: ?

Agency: The entity responsible for the administering of a research grant, project, or contract. This may represent a federal department, agency, or sub-agency (institute or center). Details on agencies in Federal RePORTER can be found in the FAQ page.

National Science Foundation
Project Funding Information for 2017:
Year Agency

Agency: The entity responsible for the administering of a research grant, project, or contract. This may represent a federal department, agency, or sub-agency (institute or center). Details on agencies in Federal RePORTER can be found in the FAQ page.

FY Total Cost
2017 NSF

National Science Foundation

$24,113

Results

i

It is important to recognize, and consider in any interpretation of Federal RePORTER data, that the publication and patent information cannot be associated with any particular year of a research project. The lag between research being conducted and the availability of its results in a publication or patent award varies substantially. For that reason, it's difficult, if not impossible, to associate a publication or patent with any specific year of the project. Likewise, it is not possible to associate a publication or patent with any particular supplement to a research project or a particular subproject of a multi-project grant.

ABOUT FEDERAL REPORTER RESULTS

Publications: i

Click on the column header to sort the results

PubMed = PubMed PubMed Central = PubMed Central Google Scholar = Google Scholar

Patents: i

Click on the column header to sort the results

Similar Projects

Download Adobe Acrobat Reader:Adobe Acrobat VERSION: 3.41.0 Release Notes
Back to Top