![]() |
|
|
|
GIS In Archaeology Lab Exercise 14 During the exercises in Vector and Raster data analysis we investigated the potential relationship between archaeological site locations and various environmental factors: distance to water, agricultural soil class, elevation, and slope. Step 1: Create Study Area Map Step 2: Create Factor Likelihood Scores Agricultural Land Type Distance to River Elevation Slope In this model the land will be classified into 1 ha units (100 X 100 meter cells) and each unit will be scored with 1-3 points for each of the four factors separately. The simplest way to split a region into mutually exclusive regions of a set size is to model the data using grids. In this way each grid cell within the study are can be scored for each factor. The final composite score can be created by simply adding the separate grid factor scores together. Step 3: Set Options for Creating Factor Grids Finally on the Cell size tab select “Same as Layer ‘dem_100m’” as the Analysis cell size. Click the OK button to accept these parameter modifications. The project is now set to analyze grid data using a 1 ha. cell size with the analysis limited to just those cells within the boundary of the survey area. Step 4: Score Area for Agricultural Land Types The result of this process is a raster based grid format that identifies each grid cell as belonging to LType 1 thru 9. For the purposes of this analyses we want to simplify the 9 ltype classes down to 3 classes. Based on the previous analysis land types 1, 2, 3 & 5 have more than the expected number of sites, 4 & 6 have about the number that we would expect, and 7, 8 & 9 have less than the expected number of sites. We will reclassify the data in the following way: From the Spatial Analyst popup menu select the reclassify tool. Select LType_Grid as the Input raster. In the reclassify table use the scores listed above to reclassify each of the LType values to be 1, 2 or 3. Set the Output raster to be called LType_Score. After clicking the OK button a new layer will be added to the map that ranks the land types into the 3 groups specified above. Step 5: Score Area for Distance to Rivers The study area is now overlaid with a grid that calculates the distance from each 1 ha cell to the nearest river. Again we will simplify these values in the following way. Open the Reclassify tool from the Spatial Analyst toolbar. Select River_Dist as the Input raster, and set River_Score as the Output raster. Click the Classify button to bring up the Classification window. Modify the classification Method to be Equal Interval, then modify the number of Classes to be 3, then set the break values to be 500, 1000, and 5000. When you return to the Classify window be sure to set the New values so that 0-500 get a value of 3, 500-1000 a value of 2 and 1000-5000 a value of 1 Step 6: Score Area for Elevation Use the same procedure as for distance to rivers to reclassify the DEM_100M layer by setting 3 breakpoints at 1000, 1300, and 5500, set the Output raster to be Elev_Score. Again be sure to set 3 as the new value for 699-1000 and 1 as the new value for 1300-5000 The resulting map classifies elevations into 3 score groups Step 7: Score Area for Slope Use the Spatial Analyst toolbar to create a slope grid layer from the DEM_100M elevation layer. Use DEM_100M as the input surface, Degrees as the output measurement, Z factor of 1, Output cell size of 100 and create an Output raster called Slope_Grid. Once created use the reclassification tool to reclassify the Slope_Grid data with break points at 5, 10 and 35, set the output layer to be Slope_Score. Use the same procedure as was outlined for distance to rivers to reclassify the DEM_100M layer by setting 3 breakpoints at 1000, 1300, and 5500, set the Output raster to be Elev_Score. Step 8: Create Composite Score In the Raster calculator window first double click on Elev_Score to add it to the formula window Next Click the button, LType_Score, , River_Score, , Slope_Score to produce the following grid calculation equation Click the Evaluate button and the new composite score layer will be created. The new layer will be added to the table of contents with a name similar to Calculation. This calculation layer is only a temporary layer created in the Spatial Analyst temporary working directory. To save this as a permanent layer with a more appropriate name right click on the entry in the table of contents then select Make Permanent from the popup menu, save the layer with the name Total_Score and rename it’s name in the table of contents to be Total Score as well. Step 9: Relcassify Composite Score Right Click on Sites_Training in the table of contents and open the attribute table. Click the Options button and select Add Field. We will Add a new field called Total_Score, of data type Long Integer. Once the field has been created right click on the field name and select Calculate Values. Set the new field equal to [Training_Site_Total_Scores.MEAN]. The click the OK button Right click on Sites_Training in the table of contents and select Joins and Relates -> Remove Joins -> Remove All Joins. This will remove all the extraneous fields from the table view. Right click on Total_Score field in the attribute table and select Summarize. At the Summarize window save the data as Training_Site_Total_Score_Summary and click OK Open the resulting summary table to view the number of sites by Total Score level From the summary table it is clear that the majority of the sites have a composite total score of 9 or greater. Once again the question remains as to whether this is a reflection of sites being preferentially distributed or whether they are simply found in the same proportions as the underlying geographic distribution of the variables. Combining these training site table and the total score table and computing the percentage covered by each yields the following summary table An examination of the table indicates that in those areas with a score of 11 or 12 sites are found in substantially greater frequencies than we would have expected by change. Areas with scores of 8, 9 and 10 have site frequencies slightly lower than what change would have predicted, and areas with scores of 4 thru 7 have substantially fewer sites than would be expected by chance. Using this information we can collapse the 9 individual scores to 3 major probability levels for site occurrence Use the reclassification function to reclassify the Total Score layer and create a new layer called Site_Prob with 3 levels corresponding to the ones listed above. Looking at the resulting map of this operation it is apparent that the majority of the training sites fall in the hi or medium probability zones. The fact that most of the training sites fall in the hi or medium level should not be surprising to us, however. The reason we should not be surprised is that the model was in fact built by first examining the relationship of these training sites to the underlying distribution of our 4 geographic variables. Since the model was built from these data we cannot logically use these same data to validate the model. One way to try to validate the model is by comparing the results to the distribution of sites that were not used in the formulation of the model – for us this would be the 100 sites in the Sites_test layer. Step 10: Test the Model Add a Long Integer field called Site_Prob the Sites_Test attribute table, calculate it’s value to be the same as the mean of the joined table and then summarize the Site_Prob field to produce the resulting table. Looking at the Site_Prob attribute table we see the following distribution of the scores. If we combine the two tables together and calculate percentage values we obtain this distribution From this we can see that although the Hi probability cells cover only 22% of the study area they contain 50% of the test sites. In the case of the Med probability they cover 57% of the study area but only contain 45% of the test sites and for the low probability cells the results are 21% and 5% respectively. Questions to think about
|