Presentation
Transcription
Presentation
Geographic Information Systems Stephen A. Matthews & Anne Vernez Moudon Discussion Group Members Tom Baranowski Mark Daniel Ann Forsyth Kipling Gallion Robin McKinnon Joseph Sharkey David Stinchcomb Celeste Torio “Context” An overall goal should be … To think creatively about how GIS and related technologies can be used in food, physical activity, and built environment research, and specifically to stretch the technology and revise the methodologies we currently use. There are opportunities and challenges. GIS Introduction/Background GIS technologies/tools/data have existed since the 1960’s, and some of the spatial methods and data issues have been known about since the early half of the 20th century. We are in a much better ‘place’ doing GIS research in 2007 than we were in 1987. GIS remains a rapidly evolving field drawing on new technologies for data collection, integration, sharing, and analysis. Importance of GIS - “Value Added” 1. New spatial data products and data collection technologies (e.g., wearable GPS & other wireless sensors, high-resolution air photos, etc.) 2. The integration of otherwise disparate data sets (socioeconomic, business, crime, landuse, terrain, road networks, traffic, health service, climate, etc.) 3. The ability to redefine conventional geographic units of analysis (e.g. based on administrative neighborhoods) and create new flexible geographies (e.g., egocentric neighborhoods) to define context. Importance of GIS - “Value Added” 4. Incorporating geographic relationships and structure in to our models (e.g. through the ability to handle proximity, distance, adjacency, and contiguity measures). 5. Flexibility in how scale can be incorporated in research. Methodologically, scale affects how we can measure and/or represent food/built environments. 6. Use and integration of exploratory spatial data analysis techniques within projects The Key Challenges / Issues • Conceptual Models • Conceptualizing Place (Neighborhood) “Beyond the Census Tract” • Tracking spatial behavior – Where do people go? • Data quality (standards) - validation issues • Privacy/confidentiality issues • Training related issues #1: Conceptual Models • Why built environment -> Obesity? • What causal pathways? Howard Frumkin (AJPM 2006) asked some key questions: • What theoretical model should underlie our definition & measurement of place? • What characteristics of place should we measure? • By what mechanism(s) does place affect outcomes of interest? (How does the environment “get into people?”) • How is human behavior and activity influenced by contextual factors? We should not be data driven? #2: Conceptualizing Place “Probably no other term is used so loosely or with such changing content as the term neighborhood, and very few concepts are more difficult to define.” “the concept of neighborhood has come down from a distant past and therefore has connotations which scarcely fit the facts when applied to a patch of life in a modern large city” Roderick McKenzie (1921) American Journal of Sociology pp. 344-345, and p. 346 Naïve Assumptions In studies of the relationship between place and health outcomes we tend to assume place = neighborhood = census unit (tract) and the primary census tract attribute of interest in the study of health outcomes = SES. But also … • Places are bounded • Places are isolated islands removed from larger hierarchical and non-hierarchical contexts • Places are typically defined at one ‘scale’ What are the boundaries? How far does a place extend? How do we capture the dynamics of place? What spatial scale is most salient with respect to outcomes? #2: Conceptualizing Place Challenging Conventional Wisdom? Areal units are particularly sacred once they have been established even though they later may become serious obstacles to the solutions of contemporary problems Ron Abler, John Adams, & Peter Gould (1971) Spatial Organization What size is a neighborhood? Despite Census Bureau portrayals of tracts as compact, stable, neighborhood-like units and their reification* as such by social scientists, population rather than territory represents the decisive criterion in defining tracts. * Investigators justify using the tract in terms of how closely it approximates a ‘real’ neighborhood. But what do we know about tracts? Substantial variation exists in the territorial size of census tracts within and across metro areas. Tract size for Riverside-San Bernardino-Ontario Metropolitan area Median: Mean: Minimum: 4.9 km2 (1.9 mi2) 121.3 km2 (46.8 mi2) 0.5 km2 (.2 mi2) Maximum: 20,700.4 km2 (7,992.5 mi2) Calculations prepared for Lee, B.A. et al (manuscript under review) “Beyond the Census Tract” Average interquartile range in tract size 2 25Intra-metropolitan highest-density metro areas: 1.1 to 6.2 km variation also occurs in tract population 25 lowest-density metro areas: 2.5 to 27.8 km2 Average min/max tract size for 100 largest metro areas 45 of the 100 largest areas have tracts falling below and Minimum: .4 km2metropolitan (.16 mi2) above the prescribed 1,500-12,000 range. 2 Maximum: 1,319.0 km (509.3 mi2) Median tract size is less than 2.6 km2 ( 1 square mile) in 22 of the top 100 metro areas but over 7.8 km2 (3 square miles) in 13 others. 23 metros have tracts of less than a square mile AND of more than 500 square miles. Calculations prepared for Lee, B.A. et al (manuscript under review) “Beyond the Census Tract” #3: Where do people go? It is clear that we do not do a good job tracking spatial behavior and this limits our understanding of where people go (to shop, to exercise). There is an established literature on the spatial patterns or spatial dimensions of daily life. Studies of “Time” geography date back to the 1960s and 1970s emerging from the Lund School in Sweden and the pioneering work of Hagerstrand). Interest in time-space movement has been revitalized in GIS environments based on the use of ‘geocodable’ activity diaries using GPS and wireless technologies. #3: Where do people go? Space-time aquarium with the space-time paths of African Americans, Hispanics and Asian Americans in the subsample. Kwan 2000, p.197. Kwan, M-P (2000) “Interactive geovisualization of activity travel patterns using 3-D GIS” Transportation Research Part C 8, 185-203. #3: Where do people go? A close-up view of downtown Portland Kwan 2000, p. 198. Kwan, M-P (2000) “Interactive geovisualization of activity travel patterns using 3-D GIS” Transportation Research Part C 8, 185-203. Multiple Activity Spaces Maps based on geoethnography as applied to the Three City Project (Matthews, various presentations and Matthews et al, 2005) Jane Jacobs (1961) The Death and Life of Great American Cities claims that three levels of neighborhood exist: the block, the community or district (of approx. 100,000), and the city as a whole Ethnographic data Boston families (N=43, ) 998 locations (removing all locations 20+ miles from respondent home) Matthews (manuscript in prep.) The salience of neighborhoods Academic Amnesia? (H. Gans) “Any city dweller can test for himself the meaning of his place of local residence. If he will list his major activities and then spot their focal centers on a map he will quickly discover that his associations and his associates are rarely to be found in the immediate vicinity of his home. Nor will he ordinarily find the home of his best friend in his neighborhood.” -- Barbara McClenahan Sociology & Social Research (1946, p. 272-3) Map : Matthews (manuscript in prep.) The salience of neighborhoods (John Everitt (1976) cites similar comments by McClenahan on use of non-local communities in 1926) #4: GIS Data Quality/Standards Without accurate, comprehensive, and timely data to work with researchers fall quickly (maybe unknowingly) into the “garbage in, garbage out” (GIGO) trap No form of presentation or analysis (descriptive or sophisticated modeling) can hide the limitations of the data. Some of the ‘limitations’ may be unknown due to a lack of metadata and/or ‘standards.’ #4: GIS Data Quality/Standards Metadata: The good news is that the US has a Federal Geographic Data Committee (FGDC) and back in 1993 they outlined a concept for a National Spatial Data Infrastructure. In part this need arises because digital data are often incomplete and/or incompatible, but the user may not know this because many datasets are poorly documented. The lack of metadata inhibits the ability to find and use data, and hinders data sharing between users. Issues can be especially relevant for “spatial” databases include: accuracy/precision boundary issues coverage timeliness/timeframe alignment scale / resolution #4: GIS Data Quality/Standards There are many groups involved in GIS standards. A key organization is the Open Geospatial Consortium (OGC) which focuses on interoperability standards but much of the work of the Federal government is unfinished. Some of these issues of standards and maintaining metadata on GIS data overlaps with training (for GIS technicians and changing the culture regarding documentation). #4: GIS Data Quality/Standards National Database of Community Indicators Although researchers such as Coulton (1997) have called for the creation of national databases that contain community-level indicators this is still not feasible as data sources and availability differ at all geographic scales. In single or small number of site studies it is easier to construct detailed databases. Sample reports – need to check to see if updates exist Neighborhood Indicators: Taking Advantage of the New Potential by G. Thomas Kingsley. Working Paper. Chicago, IL: American Planning Association, October 1998. Catalog of Administrative Data Sources Claudia J. Coulton with Lisa Nelson and Peter Tatian, in Mapping Your Community: Using Geographic Information to Strengthen Community Initiatives, by G. Thomas Kingsley, Claudia J. Coulton, Michael Barndt, David S. Sawicki, and Peter Tatian. Washington, D.C.: U.S. Department of Housing and Urban Development, October 1997. #5: Privacy/ Confidentiality Issues • We need to understand when privacy is an issue. • Understand how data are handled in a GIS and what we can and cannot map/publish. • Understand the implications for data sharing. • Need for journal reporting standards. Marc Armstrong (2002) “Geographic Information Technologies and their potentially Erosion Effects on Personal Privacy” Studies in the Social Sciences, 27 (1): 19-28. Locations of 30 randomly selected addresses in Iowa City 19 = Exact inverse address matched (63%) 63% 6 = Within one address (20%) 83% 4 = On the correct street segment (14%) 97% “This level of local accuracy means that there is significant risk that individual-level dot mapped information can be compromised to reveal addresses, and by implication, personal identities.” (p. 23) #6: GIS Training We recognized a need for GIS training researchers (project managers) and for GIS technicians - Culture of Documentation - Skill needs in research methods, basic statistics, computer science/basic database skills, visualization. We saw a need to building capacity in GIS at institutions independent of specific projects. Trained experts rather than learning as you go. #6: GIS Training University Sector Despite the facts that there are approximately 70 U.S. academic institutions that are fee-paying members of the University Consortium for Geographic Information Science (UCGIS), that the number of GIS-related courses at the undergraduate and graduate level is growing, that the number of on-line GIS certificate and Masters programs has grown, and that model GIS curricula have been developed (by UCGIS in 2006), the actual number of formal training programs offering courses on GIS and spatial analysis that are tailored towards health and social science applications are very few. #6: GIS Training Vendors The leading GIS/spatial analysis companies do offer regular workshops and even online resources based on their products (see for example ESRI (ArcGIS vendor) http://training.esri.com). These vendor courses vary widely in content, focus primarily on the product rather than necessarily the application, cover applications that are frequently tailored to the commercial use of the software, and also of importance to our target audience are not priced for the education market and can retail at around $500/day. Moreover, vendor training opportunities are rarely targeted towards health and social science research questions and applications. GIS Breakout Group Recommendations 1. Not GIS per se but we need more conceptual development and theoretical models on the mechanisms linking people and places. 2. Social and health science research is already seeing the emergence of studies where data on individuals embedded in multiple (and non-nested) geographically defined contexts are being collected and integrated. GIS and related technologies will play a key role in managing, visualizing and analyzing these kinds of data. We need to think more critically about definitions of neighborhood and measures of the characteristics of neighborhoods. We can use GIS to create person-centered neighborhoods (distance and spacetime buffers). GIS Recommendations (cont.) 3. In future studies we can expect that space and time will be increasingly partitioned into smaller collection or analytical units. We already see projects that include the space-time tracking of individuals 24/7 these technologies and research designs will permit better understandings of spatial behavior and in theory permit the use of GIS to contribute to the development of measures of exposure to our defined neighborhoods. 4. Collecting data on spatial behavior will permit researchers to focus on the utilization of specific stores, parks, etc. rather than assume an impact based solely on accessibility/proximity. Moreover, such data will help researchers better understand frequency, duration, and sequencing of food/exercise activities in relation to other activities (and integration of social networks and locations). GIS Recommendations (cont.) 5. Social/health scientists will increasingly try to measure/capture ‘neighborhood’ (and the built and social contexts of individuals) via secondary data sets. Typically these secondary data sets are not collected for the purposes of facilitating health and social science research and there is a need to validate these databases. GIS will have a role to play in these studies, above and beyond concerns over positional accuracy. We need more of this type of fundamental research across diverse contexts. 6. With more detailed spatial data now available on individuals and with the ease of integration of these data with other contextual databases the need for expertise in handling confidentiality and privacy issues will be real. GIS Recommendations (cont.) 7. Unlike the social environment and physical environment there are few national standardized measures of the built environment. We need to find government (and private sector partners ?) to help promote this kind of data as a resource. (many issues of maintenance and currency of the data remain). 8. GIS training opportunities are out there but difficult to find, and few are tailored to current research needs. 9. The largest GIS cost is personnel time (not data, hardware, or software). GIS support personnel and infrastructure between and within college campuses varies enormously. Need to build capacity and teams that can facilitate research using GIS data and spatial analysis methods. GIS Recommendations (cont.) 10. This group should probably try to develop (based on existing materials from GIS and other fields) guidelines vis-à-vis the minimum standards acceptable for the use of specific data sets and guidelines for a minimum checklist of items that ought to be reported to the ‘reader’ in submissions to journals as well as similar lists of GIS-related methodological or data issues to be addressed in crafting grant proposals.