Loading and validating Oracle spatial data
Transcription
Loading and validating Oracle spatial data
OUGF 03.11.2011 Loading and Validating data in Oracle Spatial Tomi Salmi Content • • • • • • • • • Few words about Karttakeskus and presenter What is Oracle Spatial? SDO_GEOMETRY and the bits and pieces Loading GIS data to Oracle Validating loaded spatial data Fixing invalid data Tools available (SQL Developer etc.) Pitfalls and best practices Questions? Tomi Salmi • • • • • Oracle 9i and 10g OCP DBA 10 years experience in Oracle Spatial and PL/SQL Project Manager Technical Architect Database Specialist All geospatial information services in one place • Dedicated technology independent IThouse • Highly skilled trainer and consultant • Finland's leading GIS outsourcing services provider • More than 90 years a unique expertise in spatial data • All combined with strong consumer view 11/4/2011 Karttakeskus 4 Karttakeskus Oy part of Affecto Group • Finland's largest only GIS focused company • Incorporated 1.1. 2011 • Turnover of 11M € • Staffed by 80 GIS professionals • Affecto's 100%-owned subsidiary • Affecto Group is the Nordic region's leading EIM solution provider – Turnover of 114 M € – and employs 947 the end of 2010 11/4/2011 Karttakeskus 5 What is Oracle Spatial? • Extension available to Oracle Database Enterprise Edition • Limited free version ”Oracle Locator” is included in all database editions • SDO_GEOMETRY-datatype • Functions and operators for data queries and data handling • EPSG Coordinate systems • Routing and network calculations • GeoRaster • Geocoding • Web services • 3D data in Oracle 11g History of Oracle Spatial SDO_GEOMETRY • Geometry is saved as SDO_GEOMETRY-object type, where everything about geometry is in one column • SDO_GEOMETRY is a object type: CREATE TYPE SDO_GEOMETRY AS OBJECT (SDO_GTYPE NUMBER, SDO_SRID NUMBER, SDO_POINT SDO_POINT_TYPE, SDO_ELEM_INFO MDSYS.SDO_ELEM_INFO_ARRAY, SDO_ORDINATES MDSYS.SDO_ORDINATE_ARRAY ); • SDO_Geometry object has few methods to access its attributes Metadata for Spatial columns • MDSYS.SDO_GEOM_METADATA_TABLE – Table owned by MDSYS-users. Contains metadata of ALL spatial columns of database’s tables – No updates to this table • USER_SDO_GEOM_METADATA – View to user’s tables containing spatial data column – Only view to be used to modify spatial metadata – Doesn’t get updated automatically!!!!! • ALL_SDO_GEOM_METADATA – View containg info for all tables user has atleast select priviledges Coordinate systems • Oracle Spatial has full support for geodetic and projected coordinate systems and transformations between any two of them • Oracle 10g R2 included support for EPSG and corrections for transformations • In Oracle, the coordinate system for spatial data is chosen using approriate SDO_SRID-value (if NULL, planar coordinate system is used) • Table MDSYS.CS_SRS contains information about all available coordinate systems – User can always define own coordinate systems (there are limitations) • MDSYS.SDO_DATUMS has information about datums • MDSYS.SDO_ELLIPSOIDS has information about ellipsoids • MDSYS.SDO_PROJECTIONS has information about projections Tools to load spatial Data into Oracle Database • Oracle Spatial doesn’t provide great tools to load spatial data (and rasters) into database • Many professional GIS-software supports loading data into Oracle Spatial • Oracle Spatial includes free software called shp2sdo ”Shape to SDO” – It can be downloaded from Oracle’s web-pages – Tool processes ESRI Shapefiles to text files, which can be loaded into Oracle spatial using SQL*Loader • But now we have ....... SQL Developer + GeoRaptor Spatial indexes • Like normal queries in Oracle, spatial queries need indexes to be fast • Most of the spatial functions will not work without spatial index • CREATE INDEX TOMI_SPX on TOMI(GEOLOC) INDEXTYPE IS MDSYS.SPATIAL_INDEX; • Spatial metadata must be set before you can create a spatial index • If you want to change coordinate system for the geometries, you have to drop/create the index • Creation of the spatial index takes lot’s of time and space • Oracle has two types of spatial indexes: Quadtree and R-tree Spatial indexes • Oracle9i included new spatial index option R-tree – In R-treeindex, MBR (Minimum Bounding Rectangle) is calculated for every geometry – Index has a tree structure where parent nodes MBR is calculated from child nodes MBR – Creating R-tree indexes is easy, because you don’t have to define any parameters about index structure (like in quadtree) – Indexing 3D geometries is possible Spatial indexes • Oracle’s spatial query is divided into two phases: – Primary filter will use spatial index to gather all candidates for the query’s result – Secundary filter will make detailed spatial calculations to determine the final resultset (slow) Loading and validating Oracle spatial data • Oracle is really picky about spatial data: – Data must be validated and fixed to be topologically correct Otherwise spatial operations and queries can return incorrect data/results • Most important aspect of the validation is the tolerance value stored in spatial metadata table – Using this tolerance, Oracle decides if two adjacent coordinate points are same/different Loading and validating Oracle spatial data • Picture has to objects A and B and distance between them is d. • Tolerance saved in metadata-table for geometry column determines what is a relation between these two objects: – If distance d is less than tolerance, the those objects are adjacent (B resides in A’s edge, e.g objects ”touch”) – If distance d is equal or greater than tolerance, Oracle determines these those objects are disjoint and doesn’t ”touch” each other. Loading and validating Oracle spatial data • Same principle for the tolerance is used when validating relations between coordinate points of a single geometry object: – Is the duplicate points – Does parts of polygon overlap itself (not allowed, different polygon parts can only touch each other) • Tolerance value should be chosen to correlate the accuracy of the data. • For example, if precision of the coordinate values are in one decimal (0.1 , 0.2), tolerance should be half of this -> 0.05 Loading and validating Oracle spatial data • SDO_GEOM.VALIDATE_GEOMETRY_WITH_CONTEXT( geometry IN SDO_GEOMETRY, tolerance IN NUMBER (tai diminfo IN SDO_DIM_ARRAY) ) RETURN VARCHAR2; • SDO_GEOM.VALIDATE_LAYER_WITH_CONTEXT( table_name IN VARCHAR2, column_name IN VARCHAR2, result_table IN VARCHAR2 [, commit_interval IN NUMBER ] ); • These are the most often encountered ORA-errors and their descriptions: – ORA-13356 - Adjacent repeated points in a geometry are redundant – ORA-13349 - Polygon boundary crosses itself (polygoni leikkaa itseään) – ORA-13367 - Wrong rotation for interior/exterior rings How to fix invalid geometries? • Oracle provides few tools to correct invalid geometries – Some of them can change the shape of the geometry • Area etc. can change!!!! • SDO_MIGRATE.TO_CURRENT – Procedure corrects certain errors for all geometries after loading data into database – This procedure has a long history. Oracle uses it when migrating from really old Oracle versions (8i mostly) – Corrects orientation wrong SDO_GTYPE, SRID etc. values – Doesn’t actually correct ”digitization” of the geometry – It’s good practice to use this procedure after loading spatial data from files How to fix invalid geometries? • SDO_UTIL.REMOVE_DUPLICATE_VERTICES – This function removes duplicate (redundant) vertices (points, lines) from a geometry – Returns corrected geometry • SDO_UTIL.RECTIFY_GEOMETRY – This function ”tries to fix it all” • Duplicate vertices • Polygon boundary intersecting itself • Incorrect orientation of exterior or interior rings (or both) of a polygon – Returns corrected geometry • These two functions can be found from Oracle10g upwards • If you have 8i/9i you can try to use SDO_UNION-function DEMO 1. loading SHP-file into oracle with SQL Developer 2. Enter/update metadata 3. Validating data and visualizing found errors 4. Correcting found errors Pitfalls and best practices • Oracle’s polygon node orientation < > OGC standard – Always validate/migrate data after loading it to Oracle • SDO_UNION in special cases when loading SHP-files – GIS-softwares usually handle this correctly – WFS softwares can return geometry from Oracle ”as is” • GIS software’s equal-function doesn’t work • If you mass insert/update spatial data – Drop/create spatial index – Remember validation before creating the index – Remember to check that metadatas boundaries are valid (all geometries actually are inside it) Pitfalls and best practices • If you have HUGE amount of data in same table – Partitioning (spatial indexes support this now) – Order data spatially (or with other key) inside the table If you got interested .... • http://www.spatialdbadvisor.com – Really good webpage by independent Spatial Consultant • Pro Oracle Spatial for Oracle Database 11g – Excellent book about Oracle Spatial. • http://forums.oracle.com/forums/forum.jspa?forumID=76 – OTN Forum’s Spatial-section • https://support.oracle.com – Unfortunately Oracle Spatial has bugs .... • RTFM – Oracle’s own Spatial manuals are really good Thank You !!!! Questions ? tomi.salmi@karttakeskus.fi