CURM Nineteenth Meeting, 1/29/2009
From www.norsemathology.org
Contents |
Agenda
New Business
- Reminder: we need to decide today on a title and an abstract. We actually have a few more days (it's due by January 31), but why wait?
- Title:
- Abstract:
- Let's talk about the very first slide (and maybe the second or third) that we want to produce for our presentation. Dr. Hastings has also passed along some images, that we'll want to consider using in our presentation: http://www.norsemathology.org/longa/images/Hastings/wasps/
- NKU's Celebration of Student Research and Creativity will be held April 7-9, 2009. As in past years, students have an opportunity to show their excellent work in posters, interactive or oral presentations and performances of various kinds. The website http://celebration.nku.edu/ is now ready for registration, as well as providing further information. The deadline for students to submit abstracts to appear in the program is February 18.
- I asked you to take a look at Jon's paper:
- In terms of the non-linear regression:
Problem of the Week
Something interesting
I had the R2 stuff below all laid out to talk about, but then I heard something even more interesting (from Jon): Honey Bees Can Tell The Difference Between Different Numbers At A Glance! So: can wasps count cicadas?
R2 is "unrealistically high" in some cases
| I had this discussion with Jon Hastings, which he found to be interesting -- so I'll pass it along to you. Jon and I were talking about a regression, from his paper Hastings, Jon M., Charles W. Holliday and Joseph R. Coelho. Body size relationship between sphecius speciosus (hymenoptera: crabronidae) and their prey: prey size determines wasp size, Florida Entomologist (December 2008), 91(4), 657-663. They used what I considered an inappropriate linear model (because we're fairly certain that the relationship between RWL and mass is non-linear). In the course of discussing that model we got to talking about the great R2 values we were getting.
One could easily think to oneself that if the R2 is really good, why buck the model? Well, the R2 would be high for a wide range of inappropriate models. The reason why we get such good R2 numbers in this case is because we have three populations, spaced out widely, and we can fit a pretty good model to the separate clouds. Let's see how R2 is determined, to help understand this phenomenon. | ||
|
Residuals clearly show some pattern -- which shouldn't be the case if our model captured the data trend. R2 is equal to one minus the square of the correlation between the actual response value ln(RWL) and the residuals, Looking at the residuals here, what do you think will be the correlation? Really small, right? |
||
Here's some random data that will illustrate something very similar. I generate 100 random normal values for x and for y. Then I do it again, but add 10 to every value; and then I do it once more, adding 20. Here's the code:(setq x (rand.n 100) y (rand.n 100) xx (+ 10 (rand.n 100)) yy (+ 10 (rand.n 100)) xxx (+ 20 (rand.n 100)) yyy (+ 20 (rand.n 100)) x (combine x xx xxx) y (combine y yy yyy) reg (regress (list x) y) ) | |
My Conclusion: it seems as though the residuals in the model for the ln(RWL) versus ln(Mass) also have a little bit of a tilt in them, due to the regression model which is mostly fitting the centers of the groups.
Old Business
- Dr. Hastings has passed along some preliminary work on the materials and methods section, concerning collection of the data: The Cicada-Killer and Cicada Data
