I did not have too much difficulty organizing my table or graph. For the table I had sample #, pH as read off a meter, pH as found off pH strips, and the presence or absence of underbrush as the different columns. For the graph of my data I graphed the presence of underbrush as teh dependent variable against the soil pH as read off the pH meter. The graph felt a little funky for me since the presence of underbrush is a categorical variable. I assigned 1 to be presence of underbrush and 0 to be absence of underbrush so the data looked like a Bernoulli distribution to me. When a best fit line was placed in on the scatter plot there appeared to be a slight negative relationship, but when the means of each of the two sites were calculated they were so close together only differing by a value of 0.066 pH that without being able to evaluate the data using p-values, since I can’t assume normal distribution, I have to accept the null hypothesis. This outcome is unexpected as I went in thinking that soil pH was probably an influencing factor on the distribution of the underbrush, but there are definitely other factors that could be the cause of this pattern. If I were to explore this topic further I would look into those other factors like interspecies competition, shading, soil type, and soil moisture.
Category: Post 8: Tables and Graphs
Post 8: Table & Graphs
The data I collected was the number of trees with and without ivy in a 5 m radius. Each block, A through E, had five replicates. It was difficult deciding what sort of statistics to conduct on the data. I wanted to keep it straightforward and focused on the abundance of ivy on trees within the radius and then broaden out to look at each block as a whole. I believe looking at the relative frequency of the presence/absence of ivy will achieve this. Since the blocks vary in size it could potentially be helpful to standardize the frequency somehow between the blocks.
Table 2
Summary of data collected. The first letter delineates the block and the following number the replicate. |
||||||
Point Radius (5m) | # of Trees with Ivy | # of Trees without Ivy | Total # of Trees in Radius | Relative Frequency with Ivy | Relative Frequency without Ivy | |
A.1 | 7 | 4 | 11 | 0.6364 | 0.3636 | |
A.2 | 1 | 5 | 6 | 0.1667 | 0.8333 | |
A.3 | 2 | 3 | 5 | 0.4000 | 0.6000 | |
A.4 | 5 | 1 | 6 | 0.8333 | 0.1667 | |
A.5 | 5 | 5 | 10 | 0.5000 | 0.5000 | |
ATotal | 20 | 18 | 38 | 0.5263 | 0.4737 | |
B.1 | 2 | 9 | 11 | 0.1818 | 0.8182 | |
B.2 | 0 | 8 | 8 | 0.0000 | 1.0000 | |
B.3 | 1 | 3 | 4 | 0.2500 | 0.7500 | |
B.4 | 1 | 5 | 6 | 0.1667 | 0.8333 | |
B.5 | 4 | 4 | 8 | 0.5000 | 0.5000 | |
BTotal | 8 | 29 | 37 | 0.2162 | 0.7838 | |
C.1 | 5 | 3 | 8 | 0.6250 | 0.3750 | |
C.2 | 0 | 1 | 1 | 0.0000 | 1.0000 | |
C.3 | 4 | 3 | 7 | 0.5714 | 0.4286 | |
C.4 | 1 | 5 | 6 | 0.1667 | 0.8333 | |
C.5 | 1 | 4 | 5 | 0.2000 | 0.8000 | |
CTotal | 11 | 16 | 27 | 0.4074 | 0.5926 | |
D.1 | 2 | 4 | 6 | 0.3333 | 0.6667 | |
D.2 | 1 | 5 | 6 | 0.1667 | 0.8333 | |
D.3 | 4 | 4 | 8 | 0.5000 | 0.5000 | |
D.4 | 2 | 4 | 6 | 0.3333 | 0.6667 | |
D.5 | 5 | 3 | 8 | 0.6250 | 0.3750 | |
DTotal | 14 | 20 | 34 | 0.4118 | 0.5882 | |
E.1 | 1 | 4 | 5 | 0.2000 | 0.8000 | |
E.2 | 0 | 3 | 3 | 0.0000 | 1.0000 | |
E.3 | 3 | 5 | 8 | 0.3750 | 0.6250 | |
E.4 | 1 | 5 | 6 | 0.1667 | 0.8333 | |
E.5 | 5 | 2 | 7 | 0.7143 | 0.2857 | |
ETotal | 10 | 19 | 29 | 0.3448 | 0.6552 | |
BlockTotal | 63 | 102 | 165 | 0.3818 | 0.6182 |
Blog Post 8: Tables and Graphs
For my research data, I decided to analyze it using the Kruskal-Wallis statistic test. This allows me to compare the median of my response variable to multiple intervals of my predictor variable. I used Minitab 18 to coordinate the data and calculate the medians, means and standard deviations which I then used to for the graph below.
Fig.1. Average squirrel abundance measured at View Royal Park in relation to the number of domestic dogs present.
The average amount of squirrels was calculated within each interval and plotted here with error bars that were calculated using the individual standard deviation. There is an inverse relationship between the number of dogs present and the number of squirrels observed. The statistical test used to analyze this data was the Kruskal-Wallis test. The P-value was found to be P<0.05, therefore, we can reject the null hypothesis and all the means are different.
Blog Post 8 – Tables and Graphs
I found that summarizing my data into a visually representative table or graph was relatively easy. The table I submitted summarizes the number of individuals (abundance) counted within the two habitat areas within Terwillegar Park. My prediction was that bird abundance would increase with increase in tree/forest cover. When I summarized my data into the table it was noticeable that the highest number of individuals was the forested area. In my paper I plan on creating a visually representative bar graph of the data collected as this will display the difference in abundance easier than in table form.
Table 1: Recorded point count data from Terwillegar Park, Edmonton, Alberta.
Habitat Type | Number of Birds | |
March 9, 2019 | March 12, 2019 | |
Open Area with Scattered Cover | 15 | 8 |
Forest Area
|
53 | 51
|
Blog Post 8
The graph I made shows the frequency between the total number of ash trees and the total number of ash trees infected by emerald ash borer (EAB). Each dot on the graph represents one of the 5, 25X25 foot plots containing ash trees. I included a trendline and r^2 value to assess how strong the correlation, if any, existed between my predictor and response variable. I was surprised to see my data indicates there seems to be no correlation between the two variables (very low r^2 value). In my methodology, I was not able to notice signs of EAB presence with full certainty so I expect the general rate of infestation is higher among all plots. My results underestimated infestation rates because it was limited to what I could see with my eyes (eg. larval galleries, fissures in the bark, woodpecker foraging). D-shaped exit holes are the first sign of EAB presence, however, Burr et. al note this occurs 2 years after trees are already infected. This is to say many of the trees I noted as “not-infected” might indeed be infected. Without the ability to girdle trees at the conservation area I visited, my results are not completely accurate. When I made my observations noting the signs of emerald ash borer infestation in the trees, it appeared that younger saplings were not affected by emerald ash borer even amidst other trees that showed clear signs of infestation (eg. D-shaped emerald ash borer exit holes). After some research, I came across the work of Klooster et. al which noted that by 2009, 99% of ash trees in the Upper Huron area of southeast Michigan had been killed -largely by EAB- but those that remained were younger saplings. In the future, I think it would be beneficial to do the same study but based on trees that were planted at the same time /naturally came into seed around the same time to see if there is a relationship between density of a plot and infestation rate. These areas would be characterized by the absence of younger saplings. Additionally, I would use other methods to ensure better accuracy in EAB detection (eg. tree girdling to expose hidden larval galleries and remote sensing methods to notice early canopy loss of EAB infected trees).
Burr, S. J., McCullough, D. G. and T. M. Poland. 2018. Density of Emerald Ash Borer (Coleoptera: Buprestidae) Adults and Larvae at Three Stages of the Invasion Wave. Environmental Entomology 47. https://doi.org/10.1093/ee/nvx200
Klooster, W. S., Gandhi, K. J. K., Long, L. C., Perry, K. I., Rice, K. B. and D. A. Herms. 2018. Ecological Impacts of Emerald Ash Borer in Forests at the Epicenter of Invasion in North America. Forests 9. https://doi.org/10.3390/f9050250
Post 8: Tables and Graphs
The table I created clearly displays the data collected at each site for my research. The number of alive and dead spruce trees are presented, as well as fallen spruce trees and other vegetation that was present. The most important column in the table is the percent dead trees that I calculated for each site because this is what either confirms or denies my hypothesis. I had no difficulties organizing the data since it was very simple data that was collected. The outcome was not what I expected since I expected Site 4 to have the highest percent of dead trees but instead Site 2, which has sparse tree density, actually had the highest percentage of dead trees present. I believe I got the results I did because site 2 had fewer trees than site 3 and 4, so the ratio between dead and alive trees to the total amount were close together, whereas when you have more trees your percentage is going to be lower. In order to further confirm my hypothesis, I think I need to study more sites in the area to gather more data.
Table 1. Recorded field data from Spruce Beetle Research in Kluane, Yukon. The count of dead and alive trees are stated as well as the percent of dead trees calculated for each research site.
Site – Tree Density | Alive Spruce Tree | Dead Standing Spruce Tree | Fallen Spruce Tree | Other vegetation | Percent Dead Trees |
1 – Low Density | 17 | 1 | 5 | 13 | 35.2 |
2 – Sparse | 16 | 6 | 10 | 5 | 50.00 |
3 – Cluster | 32 | 3 | 4 | 8 | 21.8 |
4 – High Density | 91 | 10 | 26 | 2 | 39.5 |
Post 8: Tables and Graphs
Here is my proposed graph from the data I collected:
I decided to show each tree species at each exposure level rather than creating three graphs of each species. I attempted several formats before concluding that this was the most representable option.
These results slightly follow the predicted pattern, Quercus garryana most abundant in low exposure regions, and Pseudotsuga menziesii present in the highest exposure level. The pattern varies off track slightly, showing that my results may require further interpretation. The Quercus garryana trend does show significantly low values at the highest exposure level. Pseudotsuga menziesii values vary with a parabola trend. The Arbutus menziesii data appear to demonstrate an unexpected trend – low values in extreme high and extreme low exposure levels. This result may indicate that Arbutus menziesii require very specific conditions for growth. Arbutus menziesii values are also relatively low, showing that in both extremes it is not the dominant species.
Further exploration could include analyzing the substrate type and soil cover in each area, to determine if the trends follow the same pattern.
Post 8: Tables and graphs
Overall, I did not have any difficulties aggregating and summarizing my data. I created a table showing the percent soil moisture content for each replicate, and also a mean percent soil moisture content for each location (Douglas fir, sagebrush and cattail). I ran an ANOVA test and t-tests on the data in this table and found that there was a significance between the values, (p < 0.05). The results from the tests did not surprise me, that is because when looking at the data, each location had percent soil moisture values that very rarely overlapped with the other locations.
Additionally, I graphed the mean percent soil moisture for each location in a bar graph. In this graph I used standard deviation error bars which did not overlap, also suggesting that there is a significant difference between the percent soil moisture content at each location.
Given my findings, it would be interesting to look at more locations (with the same Douglas fir, sagebrush, and cattail species),in different cities or even just different areas of Kamloops to see if the soil moisture varies much from what I found in my chosen locations.
Post 8: Tables and Graphs

Above is one of the graphs created from my data. As per my prediction, creosote plants that grew in soils with higher moisture levels tended to be taller than their drier counterparts. The graph was relatively easy to construct and interpret. Further regression analysis is needed to see if the line is truly statistically significant by analyzing the residual plots, but an R^2 value of 0.59 or 59% indicates that it is probable.
Blog Post 8: Tables and Graphs
The figure below (Figure 1) depicts dabbling duck abundance at my selected point count locations (1 through 8) in Colony Farm Regional Park. My research focused on habitat selection by dabbling ducks within constructed drainage channels that varied in percent cover of emergent vegetation. I initially wanted to show the average abundance per point count location, however, the total number of ducks observed during my data collection was lower than I would have hoped. Thus, I used the total number of dabbling ducks observed at each point count location. The low numbers also presented some challenges later on when assessing statistical significance, as it would likely have benefited me to have greater sample sizes to work with to truly assess any preference or habitat selection by the ducks I observed. Summarizing the data I did have, however, was fairly straightforward, and is shown below.
Figure 1. Total number of dabbling ducks counted at point count locations 1 through 8 within drainage channels of varying emergent vegetation percent cover.
During data analysis, I ended up grouping the point count location results into three groups based on emergent vegetation cover: Group 1 = <25% cover, Group 2 = less than or equal to 25% but <50% cover, and Group 3 = greater than or equal to 50% cover. I divided the groups in this manner to see if there proved to be a significant difference between the varying amounts of emergent vegetation cover and how this may have affected abundance.
Since my initial prediction was that duck abundance would increase with increasing emergent vegetation cover in the drainage channels, my results surprised me. I was not expecting so much variation in my results, with the greatest number of ducks observed within the channel that had only 30% emergent vegetation cover. I also discovered that variations in duck abundance between groups were also not statistically significant.
In future, I would conduct this study later into September when more dabbling ducks are present within the park to generate a larger sample size (seasonal effect). With greater resources, I would have also measured other factors within the drainage channels, including water quality parameters (dissolved oxygen, pH, temperature) and perhaps also water depth. With more time, I also would have conducted sampling events during different seasons over multiple years to see if this affected my results. It is clear from relevant literature that I have come across during my research project that understanding habitat selection for species can play an important role in habitat management and conservation of protected areas.