tag:blogger.com,1999:blog-3467242881996852098.post4822618466215872092..comments2024-01-08T08:39:39.026-08:00Comments on The Open Source Paleontologist: Common Mistakes in Scientific Writing [or, A Pedant's Paradise]Andyhttp://www.blogger.com/profile/16171447306687358664noreply@blogger.comBlogger13125tag:blogger.com,1999:blog-3467242881996852098.post-45721174706042815412011-01-20T03:24:17.554-08:002011-01-20T03:24:17.554-08:00Might seem weird that a linguist is reading this b...Might seem weird that a linguist is reading this blog, but I had to weigh in, because if people who write style guides that admonish the "which/that" difference and discourage "runs" as a verb were serious about language use, then more linguists would have jobs. "Data" is a plural form in latin of "datum," equivalent to the english past participle as in "had given" or "been given." This is largely unimportant, however, because we speak English, not Latin. Non-linguists (or grammarians) are often shocked to learn that the distribution can be bimodal. if you actually look in most dictionaries, they do treat "data" as a mass noun, not a plural, even in scientific writings (See <a href="http://corpus.byu.edu/coca/" rel="nofollow">COCA</a> - since 1990, the ratio between mass-singular/plural has been 7/16 in academic journals). <br /><br />The primary problem is that dictionaries are written by grammarians, who don't like ambiguity and use etymology to dictate usage - this is like calling modern birds dinosaurs because they evolved from forms that people used to designate as "dinosaurs" - the latin pp is different than the English pp. And most style manuals are written by specialists in their field (which is for the best), who don't have the expertise to realize that grammarians are often uninformed.<br /><br />There's a further issue with complex restrictive det-Phrases ("bacteria on teeth is a major problem" vs. "the bacteria on her teeth are a major problem"), because in English sometimes we nominate entire clauses, not just proper nouns, but I think I'll just show myself out...Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-28879065709273117332011-01-09T08:48:01.029-08:002011-01-09T08:48:01.029-08:00By far the most common:
"X is the most basal...By far the most common:<br /><br />"X is the most basal Y".<br /><br />No terminal taxon in a cladogram is any more "basal" than another—by definition. What people mean is: "X is the sister group of all other Y's". Furthermore, we all know this, and only apply the term "most basal" when we have an imbalanced clade. Nobody calls the Actinopterygii the "most basal" osteichthyans, but they're perfectly happy calling Chondrichthyans "basal gnathostomes".Martin Brazeauhttps://www.blogger.com/profile/11783049769991776341noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-83400416003653764392010-12-29T10:05:11.640-08:002010-12-29T10:05:11.640-08:00"Mya" (and even lowercase "mya"..."Mya" (and even lowercase "mya") are considered perfectly acceptable abbreviations by the USGS and the like, as they are unambiguous in being dates rather than durations. Ma is more formal, however.Thomas R. Holtz, Jr.http://www.geol.umd.edu/~tholtz/noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-83502555681109008432010-12-28T10:20:21.521-08:002010-12-28T10:20:21.521-08:00In North American journals we put a comma after e....In North American journals we put a comma after e.g. (e.g.,) but in European journals a comma is not used.Bill Parkerhttps://www.blogger.com/profile/05941940882532354219noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-80943218878099433622010-12-28T02:25:35.847-08:002010-12-28T02:25:35.847-08:00Useful thread, Andy. I could not agree more about...Useful thread, Andy. I could not agree more about "monophyletic clade" -- it bugs me as much now as it used to when my primary-school friends referred to a "round circle".<br /><br />Tom, I've been using Mya -- is that Just Plain Wrong?<br /><br />Anonymous: while it's true that language changes, that doesn't we have to blindly follow every widespread mistake that gets perpetuated via 4chan and Reddit. I will never write "alot" when I mean "a lot", nor "could care less" for "couldn't care less". Still, it's not clear that singular-data is Just Plain Wrong in the same way as those. I think I'd write "data are" in my own own, but I probably wouldn't correct "data is" if I found it in a manuscript that I was reviewing.<br /><br />On the Upper/Lower vs. Late/Early distinction: I understand it, but I don't see the point. Really, what information would be lost we as a community just dumped Upper/Lower and used Late/Early everywhere?<br /><br />on Since vs. Because: oh, please. The use of "since" in these cases is perfectly clear and unambiguous. Like a lot of the rules in the JVP style-guide, it represents nothing more than someone arbitrary preference. It is a COMPLETE waste of time.Mike Taylorhttps://www.blogger.com/profile/06039663158335543317noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-154565149095034412010-12-27T06:13:10.562-08:002010-12-27T06:13:10.562-08:00Regarding Lower/Upper and Early/Late, it's als...Regarding Lower/Upper and Early/Late, it's also only proper to capitalize the lithostratigraphic terms (Early and Late) when they refer to real chronological divisions. The Cretaceous period has an Early and Late epoch, but no Middle, whereas you can refer to the Early, Middle, and Late Jurassic. Hence, you can talk about Upper Cretaceous and Lower Cretaceous, but middle Cretaceous has to stay lowercase because it is not formalized.Matt BKhttps://www.blogger.com/profile/12583564428711476409noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-41759496639366384502010-12-26T21:24:43.611-08:002010-12-26T21:24:43.611-08:00"Comprised of" in the place of "com..."Comprised of" in the place of "composed of."Tor Bertinhttps://www.blogger.com/profile/05243812178214071957noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-23386640891198150092010-12-26T20:33:17.176-08:002010-12-26T20:33:17.176-08:00@220mya - thanks for the correction. Done.
@Anony...@220mya - thanks for the correction. Done.<br /><br />@Anonymous - This post is primarily concerned with scientific, not popular, communication. In many cases (as outlined in this post, for instance) the predominant usage simply does not belong in a scientific paper. Data vs. datum is a prime example - good technical writing demands precise, correct usage. As 220mya points out, all major dictionaries give "data" as the plural form in scientific usage. (and for the record, I do strive to use the two correctly when speaking, but don't get too upset if members of the public don't)<br /><br />@everyone - thanks for the contributions; keep 'em coming!Andyhttps://www.blogger.com/profile/16171447306687358664noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-90346082563867911732010-12-26T18:19:18.846-08:002010-12-26T18:19:18.846-08:00At risk of winning a prize for pedantry, here'...At risk of winning a prize for pedantry, here's one that really gets me (originally courtesy of my PhD advisor):<br /><br /><b>'Since' as a synonym for 'because'</b><br />The word 'since' implies relative time as an adverb, preposition, and/or conjunction. However, it should not be used as a conjunction when implying causation (i.e., where you'd use 'because').<br /><br /><i>Incorrect:</i> Since the foramen is absent, we cannot code character 32.<br /><i>Correct 1:</i> Because the foramen is absent, we cannot code character 32.<br /><i>Correct 2:</i> Since the beginning of the Cretaceous, flowering plants have diversified to become a major component of terrestrial ecosystems.220myahttps://www.blogger.com/profile/06403919493457640549noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-9431102434894570872010-12-26T18:11:36.121-08:002010-12-26T18:11:36.121-08:00Andy - Great post! But I'm afraid you have no...Andy - Great post! But I'm afraid you have not quite got the Lower/Upper and Early/Late thing correct. Indeed, Lower/Upper should be applied to lithologic units, but this is <i>lithostratigraphy</i>, whereas <i>chronostratigraphy</i> is simply the application of <i>geochronology</i> to the geologic timescale.<br /><br />Anonymous - 'data' are in fact plural in English too - just see most of the entries here: <a href="http://dictionary.reference.com/browse/data" rel="nofollow">[Definition of 'Data']</a>. Regardless of its use colloquially, 'data' are most definitely plural in scientific writing; any usage to the contrary is a failure of the author and/or editor.220myahttps://www.blogger.com/profile/06403919493457640549noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-51097861679824448842010-12-26T14:38:38.351-08:002010-12-26T14:38:38.351-08:00"Data" is the Latin plural. We speak Eng..."Data" is the Latin plural. We speak English, not Latin. If you just say what comes naturally, you'll find yourself using "data" in the singular. I bet you're not 100% consistent on that point anyway. <br /><br />Do you insist on using "opera" as a plural too? It's the Latin plural of "opus" you know. Language evolves. Don't tell someone that they're wrong if they're following the predominant usage.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-33920074004070072702010-12-26T14:10:17.509-08:002010-12-26T14:10:17.509-08:00ka, Ma, and Ga are for dates; kyr, Myr, and Gyr ar...ka, Ma, and Ga are for dates; kyr, Myr, and Gyr are for durations. Nothing <b>lasts</b> for 10 Ma, any more than things last for 1492 AD.Thomas R. Holtz, Jr.http://www.geol.umd.edu/~tholtz/noreply@blogger.comtag:blogger.com,1999:blog-3467242881996852098.post-53858749823271584872010-12-26T13:56:33.415-08:002010-12-26T13:56:33.415-08:00Let me beat a dead equid a bit more. Unfortunatel...Let me beat a dead equid a bit more. Unfortunately, the term "middle" is in common usage for both stratigraphic and temporal descriptions. Jim Martin has always encouraged his students to use the term "middle" for stratigraphy, and "medial" for temporal descriptions. I've done this in my publications and it has worked very well.Darrin Pagnacnoreply@blogger.com