B.1. Vector Data

 < Day Day Up > 

What is vector data? This quote is a good description of what vector data is:

Vector: An abstraction of the real world where positional data is represented in the form of coordinates. In vector data, the basic units of spatial information are points, lines, and polygons. Each of these units is composed simply as a series of one or more coordinate points. For example, a line is a collection of related points, and a polygon is a collection of related lines. Vector images are defined mathematically as a series of points joined by lines. Vector-based drawings are resolution independent. This means that they appear at the maximum resolution of the output device, such as a printer or monitor. Each object is self-contained, with properties such as color, shape, outline, size, and position on the screen.

From: http://coris.noaa.gov/glossary/glossary_l_z.html#v

MapServer can access vector file formats and database connections. It can also access raster or image data. This is a summary of the vector and database formats with particular focus on how to use them in a MapServer map file and access them using command-line tools such as ogrinfo.

B.1.1. MapServer and Vector Data Access

MapServer offers two methods for accessing data:


Using built-in, format-specific, data access capabilities

The most basic form of data access uses the built-in capabilities that were linked into MapServer when it was compiled. These capabilities are limited to only a few types of vector data, such as ESRI shapefile, PostGIS, Oracle Spatial, and ArcSDE. The default, built-in format for MapServer is the ESRI shapefile format.


Using the capabilities of third-party data access libraries

One of the most powerful features of MapServer is the ability to use data formats through a pseudo plug-in environment. The most significant third-party library being used is GDAL/OGR. This includes raster (GDAL) and vector (OGR) data.

B.1.1.1 Using OGR

OGR is used behind the scenes by MapServer when requested. MapServer doesn't require OGR in order to run. Some users may never need the additional capabilities OGR offers, but many users find them absolutely necessary. Because MapServer can access data via OGR, you don't have to program specific types of data format support directly into MapServer. Instead, you can make further additions to OGR and then use MapServer. In essence, the background libraries allow MapServer to bring the data into an internal, memory-based format that MapServer can use. For the most part, using OGR-related formats is seamless and intuitive.

B.1.2. Data Format Types

There are three types of data mapping and GIS data formats. Each type is handled differently. Here are the types and some example formats:


File-based

Shapefiles, Microstation Design Files (DGN), GeoTIFF images


Directory-based

ESRI ArcInfo coverages, U.S. Census TIGER


Database connections

PostGIS, ESRI ArcSDE, MySQL

Each type of data is made up of a data source and (one or more) layers. These two definitions apply to MapServer and OGR:


Data source

A group of layers stored in a common repository. This may be a file that handles several layers within it or a folder that has several files.


Layer

A subset of a data source often containing information in one type of vector format (point, line, polygon).

B.1.2.1 File-based data

File-based data consists of one or more files stored in any arbitrary folder. In many cases, a single file is used (e.g., DGN) but ESRI shapefiles, for example, consist of at least three files, each with a different filename extension: SHP, DBF, SHX. In this case all three files are required because they each perform a different task internally.

Filenames usually serve as the data source name and contain layers that may or may not be obvious from the filename. In shapefiles, for example, there is one data source per shapefile and one layer that has the same name as that of the file.

B.1.2.2 Directory-based data

Directory-based data consists of one or more files stored in a particular way within a parent folder. In some cases (e.g., coverages), they may also require additional folders in other locations in the file tree in order to be accessed. The directory itself may be the data source. Different files within the directory often represent the layers of data available.

For example, ESRI ArcInfo Coverages consist of more than one file with an ADF file extension, within a folder. The PAL.ADF file represents the Polygon data. ARC.ADF holds the arc or line string data. The folder holds the data source, and each ADF file is a layer.

B.1.2.3 Database connections

Database connections are similar to file- and directory-based structures in one respect: they provide geographic coordinate data for MapServer to interpret. That may be oversimplifying what is happening inside MapServer, but in essence all you need is access to the coordinates making up the vector datasets.

Database connections provide a stream of coordinate data that is temporarily stored (e.g., in memory) and read by MapServer to create the map. Other attribute or tabular data may also be required, but the focus of this guide is coordinate data.

One important distinction between databases must be made. The databases discussed here are spatial databases, those which can hold geographic data in its own data type. This is opposed to strictly tabular databases that can't hold geographic coordinates in the same way. It is possible to store some very simple coordinate data in regular tables, but for anything but the most simple use, a spatial database is required. There are spatial extensions to many databases (open source and commercial). One of the most robust is the PostGIS extension to the PostgreSQL database. This database not only allows the storage of geographic data, but also allows the manipulation of that data using SQL commands. The other open source database with spatial capabilities is MySQL.

Connections to databases usually consist of the following pieces of connection information:


Host

Directions to the server or computer hosting the database.


Database name

The name of the database you wish to access that is running on the host.


User name/passwords

Access privileges are usually restricted by user.

Some databases (e.g., Oracle) use a name service identifier that includes both the host and database names.


Access to specific pieces of coordinate data usually require:


Table/view name

The name of the table or view holding the coordinate data


Geographic column name

Where geometry or coordinates are stored

     < Day Day Up > 


    Web Mapping
    Web Mapping Illustrated: Using Open Source GIS Toolkits
    ISBN: 0596008651
    EAN: 2147483647
    Year: 2005
    Pages: 138

    flylib.com © 2008-2017.
    If you may any questions please contact us: flylib@qtcs.net