= PostGIS Raster SoC Idea 2012 - Distance Analysis Tools for PostGIS Raster = [[BR]] {{{ #!html Student:  Qing Liu }}} [[BR]] '''Mentor:'' [http://www.cef-cfr.ca/index.php?n=Membres.PierreRacine Pierre Racine] ---- '''Idea''': [[BR]] The idea for this proposed project is to add two spatial analysis functions to PostGIS Raster that implement two main ways of performing distance analysis: Euclidean distance calculation and cost-weighted distance calculation. Euclidean distance function will create a distance surface representing the Euclidean distance from each cell in the source layer to the starting point or the closest source (as designated by user). The basic concepts of the algorithm would be using the center of the source cell to calculate the distance from it to the rest cells in the raster layer or within the user-defined extent. Cost-weighted distance will create a raster layer representing the accumulative cost distance for each cell to the starting point or the closest source (as designated by user). A cost raster layer will be generated using one or several factor layers representing the impedance of passing through each cell according to user’s criteria. User can also put weights on the input factor layers to represent different levels of importance for those factors. Factors such as: elevation, slope, orientation, land use / land cover type; vehicle speed, accessibility; and a binary map layer could also be used as a mask for defining geographic extent or as a filter for the output cost layer. The accumulative cost values will be then assigned to each cell representing the cost per unit distance for moving through this cell. '''GSoC 2012 Proposal: ''' [[BR]] [http://google-melange.appspot.com/gsoc/proposal/review/google/gsoc2012/kya_na/1] [[BR]] ---- = Weekly Reports = {{{ #!html


}}} ---- {{{ #!html
   
back to top
}}} == Week 1 Report == '''What did you get done this week?''' * Compiled PostGIS 2.0 sucessfully. * Loaded raster data into PostgreSQL and practiced some query by finishing the [http://trac.osgeo.org/postgis/wiki/WKTRasterTutorial01 raster/vector tutorial]. * Wrote an analysis of how Euclidean distance and Cost-weighted distance are computed in ArcGIS and GRASS. * Setup wiki page. ''' What do you plan on doing next week?''' * Learn more about PostGIS Raster concepts and functions * Write an analysis of how Euclidean distance and Cost-weighted distance are computed in Oracle * Start to write a proposal on how to adopt the concepts and algorithms in PostGIS. '''Are you blocked on anything?''' * As of now I was not blocked on anything yet, but working in a spatial database is something new and challenging to me. I will need to read and reserch more about it. * However, it took me some time to understand how raster coverage is stored in PostGIS, and how Raster type works. [[BR]] === My Analysis Reports === * [https://docs.google.com/document/d/1G6Ge3239kCBkn5qqpa1_0PNsvM9OWewsbI-2ZtSc3Cc/edit How Euclidean distance and Cost-weighted distance are computed in ArcGIS and GRASS] ---- {{{ #!html
   
back to top
}}} == Week 2 Report == '''What did you get done this week?''' * Read Documents and [http://www.cef-cfr.ca/index.php?n=Membres.PierreRacine Pierre Racine]'s [http://www.cef-cfr.ca/index.php?n=Membres.PierreRacine#documents presentations] on PostGIS Raster; Learned more about the raster data storage and manipulation in PostGIS * Read [http://www.oracle.com/pls/db111/portal.portal_db?frame=&selected=7 Oracle Spatial Documents] on [http://docs.oracle.com/cd/E14072_01/appdev.112/e11827/toc.htm GeoRaster]; Learned about raster data storage in Oracle. ''' What do you plan on doing next week?''' * Learn more about PostGIS Raster development * Write a comparison of raster data storage and manipulation in PostgreSQL and Oracle * Write a proposal on how to adopt the concepts and algorithms of distance calculation in PostGIS. '''Are you blocked on anything?''' * It seems that Oracle Sptial dosenot provide distance analysis [http://docs.oracle.com/cd/B28359_01/appdev.111/b28398/geor_operations.htm#CIHIICDF functions for GeoRaster] data. '''Please let me know if I missed it.'''[[BR]]However, by reading documents of !GeoRaster, I got a better understanding of the raster data storage in spatial database. So I feel it would be helpful for me to write an analysis to compare the concepts of raster data storage and manipulation with PostGIS Raster and Oracle Spatial !GeoRaster. {{{ #!html

Comment from Mentor:

I really think we want to avoid having to produce an intermediate raster of sources. PostGIS is strong on raster/vector interaction and I really don't see why someone would prefer to provide sources as raster. The sources should come from a table of point. Now this raise a certain number of issues:

-What if the table contains more than one point?
-What if some of those points are outside the raster extent?
-Another issue is: What if the raster I want to produce does not fit into the PostgreSQL maximum field size?

These are the kind or difficulties one encounters when working in a database.

Think about large tiled coverage, Delaunay triangulation, aggregate functions and all the combinatory of "small/very large number of point" and "small/very large raster coverage". Ideally we want to provide a solution working for every limit cases.


}}} ---- {{{ #!html
   
back to top
}}} == Week 3 Report == '''What did you get done this week?''' * Wrote a comparison of raster data storage and manipulation in PostgreSQL and Oracle * Wrote a proposal on how to adopt the concepts and algorithms of Euclidean distance calculation in PostGIS. ''' What do you plan on doing next week?''' * Write a proposal on how to adopt the concepts and algorithms of Cost-weighted distance calculation in PostGIS '''Are you blocked on anything?''' * Based on the feed back I got from Pierre last week, I agreed that we want to avoid having to produce an intermediate raster of the source point data. Since PostGIS Raster provide the capability of seamless vector-raster interactions, it is preferable that we expect the input source point data to be a vector layer, which is stored in PostgreSQL as a table of points with geomegry. The concepts of the distance calculation in ArcGIS and GRASS were all based on raster source data (ArcGIS will first rasterize the vector layer if necessary). So I was stucked at this point on how to avoid generating this intermediate raster layer if we want the source data to be vector points. [[BR]] === My Analysis Reports === * [https://docs.google.com/document/d/1-5KorOwkdnR8BUZsDBvj_ez3gXmdEh8WHKtiJW3smrQ/edit Compare Raster Data Storage and Manipulation in PostgreSQL and Oracle GeoRaster] * [https://docs.google.com/document/d/1orjY4PnkFX4Kv2m8bpEpCUyf-fW_-fb0_2V37FhqBO8/edit Generate Euclidean Distance Surface in PostGIS (Proposal) (first working on a single raster row)] {{{ #!html

Comment from Mentor:

- "Algorythm: a. Make an empty raster new_rast, using the same extent as the input vector point data source_point, set pixel value to NoData value." I don't think this is a good idea as you would have no control on the alignment of the raster produced. I want to be able to control the dimensions and how the raster is georeferenced. I think your best friend here is ST_AsRaster() which allows passing a geometry and some raster properties and burns the geometry in the raster. Basically ST_AsDistanceRaster() should not be so different from ST_AsRaster(), the only difference is that instead of burning the geometry in the raster, we compute a distance to the geometry for each pixel. It is very possible in this case that the geometry will be outside the extent of the requester raster. Should we want this to work for lines and polygons?

-You should avoid to do two passes to compute the final pixel values: one to compute the distance in pixel/cell and another to convert into the geographical distance.

-What do ArcGIS and GRASS do when there is more than one point? We need a more detailed approach for this case.

-I want also a kind of approach to the problem of billions of points. Does the knn indexing help us in quickly identifying what is the nearest point to the pixel without rescanning all the point for every pixels.

-In general I don't think just copying what ArcGIS or GRASS used to do is what we want. We want a true raster/vector integrated approach, scalable to billions of points, enabling the production of very large tiled rasters. I know this is a much more difficult problem but we want to end up with a variety of easy to use tools. I think this would be a much greater addition to PostGIS than just converting one raster to another raster. Think about aggregate functions or passing a geometry table name to the function. ST_AsRaster was done to work on only one geometry but how could it work to burn many geometries into one raster? Look at ST_UnionToRaster in the PostGIS raster working specifications. (http://trac.osgeo.org/postgis/wiki/WKTRaster/SpecificationWorking03)

-Look also at this mail: http://postgis.refractions.net/pipermail/postgis-users/2012-April/033875.html it might be a general solution working on small and big raster but might be slower than a specific solution. We could simply implement a bunch of custom function in C...

-Start thinking about the signature of some prototype SQL functions. Think about passing the name of a point table

-I think the constraints should be explicited first in your document so we know in which context we want all this to work.


}}}