移动课堂
您的位置:外语教育网 > 英语四六级 > 试题中心 > 六级试题中心 > 正文

2017年6月英语六级仔细阅读2题源文章

2017-06-18 10:56   来源:外语教育网       我要纠错 | 打印 | 收藏 | | |

Data-sharing: Everything on display

· Richard Van Noorden

Nature

Published online

07 August 2013

This article was originally published in the journal Nature

Researchers can get visibility and connections by putting their data online — if they go about it in the right way.

Subject terms: Careers Databases Publishing

Lizzie Wolkovich always felt she ought to make her research data freely available online. “The idea that data should be public has been in the background through my entire career,” she says.

Yet in 2003–09, while she was working on her ecology PhD, there were few incentives for her to share. Sharing would not help her to get grants or publications, and although posting data online was not unheard of, few researchers actually did it, she says. Many preferred to hang on to their hard-won field data, sharing privately if they did so at all.

But after she earned her doctorate, Wolkovich overcame her hesitation, thanks to a combination of helpful colleagues, improved resources and a discernible shift in the research community's attitude. So in 2010, through an online data repository called the Knowledge Network for Biocomplexity, Wolkovich released her doctoral data set — the fruit of thousands of hours spent measuring the diversity of arthropods in 56 experimental soil plots she had set up in the arid scrubscape of southern California. Since then, she has publicized all the data that she has collected, including a meta-analysis of 50 other studies that she examined to see how factors such as rising temperatures affect the life cycles of plants. Wolkovich, now at the University of British Columbia in Vancouver, Canada, says that she herself had never objected to sharing her results — she had just not known how to do so. She likes the fact that her data are now easily accessible to other researchers and anyone else who is interested. “It saves me so much time,” she says.

Wolkovich is one of a number of early-career researchers who are enthusiastically posting their work online. They are publishing what one online-repository founder calls small data — experimental results, data sets, papers, posters and other material from individual research groups — as opposed to the 'big data' spawned by large consortia, which usually employ specialists to plan their data storage and release. The many resources now available give researchers options for where and how to post their data, releasing potentially fruitful data sets that used to be locked up in unpublished paper files, buried in journal-article appendices or hidden away on scientists' hard drives.

Opening up

Open data-sharers are still in the minority in many fields. Although many researchers broadly agree that public access to raw data would accelerate science — because other scientists might be able to make advances not foreseen by the data's producers — most are reluctant to post the results of their own labours online (see Nature 461, 160–163; 2009). When Wolkovich, for instance, went hunting for the data from the 50 studies in her meta-analysis, only 8 data sets were available online, and many of the researchers whom she e-mailed refused to share their work. Forced to extract data from tables or figures in publications, Wolkovich's team could conduct only limited analyses.

Some communities have agreed to share online — geneticists, for example, post DNA sequences at the GenBank repository, and astronomers are accustomed to accessing images of galaxies and stars from, say, the Sloan Digital Sky Survey, a telescope that has observed some 500 million objects — but these remain the exception, not the rule. Historically, scientists have objected to sharing for many reasons: it is a lot of work; until recently, good databases did not exist; grant funders were not pushing for sharing; it has been difficult to agree on standards for formatting data and the contextual information called metadata; and there is no agreed way to assign credit for data.

But the barriers are disappearing, in part because journals and funding agencies worldwide are encouraging scientists to make their data public. Last year, the Royal Society in London said in its report Science as an Open Enterprise that scientists need to “shift away from a research culture where data is viewed as a private preserve”. Funding agencies note that data paid for with public money should be public information, and the scientific community is recognizing that data can now be shared digitally in ways that were not possible before. To match the growing demand, services are springing up to make it easier to publish research products online and enable other researchers to discover and cite them. There are so many, in fact, that choosing where and how to publish data sets and other supplementary material can be confusing (see'Abundant options').

Box 1: Abundant options

2017年6月英语六级仔细阅读2题源文章

“Lots of people are getting into data-hosting, and I think it will be tricky to decide where to put your data,” says Heather Piwowar, who studies data-sharing for the US National Evolutionary Synthesis Center in Durham, North Carolina.

Share and share alike

Although exhortations to share data often concentrate on the moral advantages of sharing, the practice is not purely altruistic. Researchers who share get plenty of personal benefits, including more connections with colleagues, improved visibility and increased citations. The most successful sharers — those whose data are downloaded and cited the most often — get noticed, and their work gets used. For example, one of the most popular data sets on multidisciplinary repository Dryad is about wood density around the world; it has been downloaded 5,700 times. Co-author Amy Zanne, a biologist at George Washington University in Washington DC, thinks that users probably range from climate-change researchers wanting to estimate how much carbon is stored in biomass, to foresters looking for information on different grades of timber. “I would much prefer to have my data used by the maximum number of people to ask their own questions,” she says. “It's important to allow readers and reviewers to see exactly how you arrive at your results. Publishing data and code allows your science to be reproducible.”

Even people whose data are less popular can benefit, adds Piwowar. By making the effort to organize and label files so that others can understand them, scientists become more organized and better disciplined themselves, and can avoid confusion later on. “It is often very hard to find and understand your own work if you are looking at it years from now,” says Piwowar. Scientists might be inclined to stuff their data into folders that can get lost and muddled — but if they store the files in an online repository, they are forced to curate and collate the data, she says.

HEATHER PIWOWAR

Heather Piwowar: “Lots of people are getting into data-hosting, and I think it will be tricky to decide where to put your data.”

The fear of being scooped is a powerful inhibitor. But scientists can put an embargo on their data, so that only they can see the work until they are ready to make it public. And data sets are becoming increasingly citable, bringing their authors formal recognition: data published in a data journal, on Dryad or on the repository figshare.com are given a digital object identifier (DOI) that can be referenced in other publications. (Figshare is owned by Digital Science, a sister company to Nature Publishing Group.)

Would-be sharers often worry that their data are too disordered or shoddy to release into the world. “I make my data available, and it can be a pain. I'm also scared and embarrassed about errors — most of us are, especially early-career scientists,” says Piwowar. “We don't yet have a culture of forgiveness around that, unlike in computer programming, where everyone knows there are bugs in code.” She advises researchers to look into repositories to get a sense of the quality standard for experimental data. “It doesn't have to be perfect,” she says. “It's probably less thorough than you think.”

As sharing grows more common, scientists may worry less about posting data sets. “Ultimately, data will be so ubiquitous that we will no longer be in a world where researchers are so scared,” says Carl Boettiger, an ecologist at the University of California, Santa Cruz, who keeps his entire laboratory notebook open online (see Nature 493, 711; 2013). “At the end of the day, science is a social process. You will never get there hiding yourself and your work,” he adds.

" At the end of the day, science is a social process. You will never get there hiding yourself and your work. "

The right place

Depositing data on a personal website is unlikely to be the best way to get it reused and cited. For a start, the website may not be around in five years, says William Michener, director of e-science initiatives at the University of New Mexico in Albuquerque. Michener is principal investigator for a multinational programme called DataONE, which is funded by the US National Science Foundation and promotes best practices to scientists as part of its aim to make data more discoverable. Journal publishers back up their research papers with the help of non-profit archiving services such as Portico and CLOCKSS, which are financed by participating libraries and publishers, and which store material on a number of servers so that it will not disappear if a publisher goes bankrupt. Some data publishers have similar contingency plans, and Piwowar recommends looking into them. If no back-up plans are in place, she says, “it suggests they haven't prioritized well enough how to steward their data”.

Just as important as sharing data publicly is making sure that other researchers can understand them. Susanna Assunta-Sansone, associate director of the Oxford e-Research Centre at the University of Oxford, UK, says that putting out data without noting what it means will ensure that “it's not really reusable”. To avoid this, researchers must choose appropriate metadata: descriptions of the data's content and how they are arranged and set up. This type of curation is useful not just for human readers, but also for computer programmes that might be used to search through or connect data sets. Intelligent searches often rely on whatever descriptive metadata researchers have attached to the data. The metadata are read by an application programming interface (API), a set of commands that computer programmes use to interact with data stores and pull information from them. Not all data repositories use APIs; those that do not may not be the best places to store or release information, because it could be hard for anyone to find.

Sites that are dedicated to hosting particular types of data, such as DNA sequences, usually tell submitters what format is appropriate. They may require data to be entered using an online form or following specific instructions. By contrast, generalist sites — such as institutional repositories, data journals or ventures similar to figshare.com — may have looser requirements. This has the potential to result in a blizzard of different formats and descriptive tags, which could make discovering and reusing data more difficult, so researchers should pay close attention to the norms in their fields.

Decisions about metadata standards should be made early in a research project, says Michener. DataONE has provided a primer on best practices, as has a tool called DataUp, run through the University of California Curation Center in Oakland to help researchers to create data packages that are good enough to put online. Other aspects of data-sharing to consider early on include the information's sensitivity and whether some parts must be stripped out to avoid, for example, identifying human study participants or the locations of endangered species. Researchers also need to be clear about whether they will allow their data sets to be used for any purpose, or whether they would like to limit reuse to, for example, non-commercial applications. One widely understood way of documenting reuse rights is by giving the data one of several different Creative Commons licences.

Ultimately, says Michener, early-career researchers need to pay attention to new and developing ways to share data, and to the standardized formats that are emerging to make data easier to search and discover. Those who do not, he says, should rethink why they are doing research. “I think we are just now reconnecting with what science is all about — not just creating new knowledge, but also sharing the information and data that underpins those discoveries.”

相关资讯:
网站导航:
 学位英语 指南 动态 经验 试题 资料  托福 指南 动态 考情 留学 复习
 雅思 指南 动态 机经 经验 辅导  公共英语 指南 动态 备考 试题 辅导
 日语 指南 资讯 辅导 留学 考试  法语 发音 词汇 语法 听说 阅读
 韩语 入门 口语 阅读 留学 文化  西语 口语 词汇 阅读 留学 风采

学位英语免费试听

更多>>
  • 四级辅导
  • 六级辅导
全科套餐
280元/门
超值优惠套餐=写作+词汇+听力+阅读+翻译+真题精讲班 70课时
词汇串讲 精讲大纲词汇,轻松记忆单词
课时数:10课时
阅读串讲 紧扣大纲要求,直达阅读高分
课时数:10课时
听力串讲 剖析解题秘笈,提升听力水平
课时数:10课时
写作串讲 解读命题规律,揭秘高分技巧
课时数:10课时
翻译串讲 梳理重要考点,提高应试能力
课时数:约6课时
真题精讲 讲授历年真题,直击命题精髓
课时数:24课时

网校介绍

更多>>

外语教育网(www.for68.com)是北京东大正保科技有限公司(CDEL)旗下一家大型外语远程教育网站,正保科技成立于2005年7月,是国内超大型外语远程教育基地,上榜“北京优质教育资源榜”--“百万读者推崇的网络教育机构”。


公司凭借雄厚的师资力量、先进的网络视频多媒体课件技术、严谨细致的教学作风、灵活多样的教学方式,为学员提供完整、优化的外语课程,既打破了传统面授的诸多限制,发挥了网络教育的优势,也兼顾面授的答疑与互动特点,为我国培养了大量优秀的外语人才。


为了满足学员学习不同语种、不同阶段的学习需求,网站开设了包括考试英语、行业英语、实用口语以及小语种在内的百余门语言学习课程,涵盖英语、日语、韩语、俄语、德语、法语、西班牙语、意大利语、阿拉伯语等主要语种,供学员自由选择。此外,网站还拥有各类外语专业信息和考试信息20余万条,是广大学员了解外语类考试最新政策、动态及参加各语种培训的优质网站。


北京东大正保科技有限公司成立于2000年,是一家具备网络教育资质、经教育部批准开展远程教育的专业公司,为北京市高新技术企业、中国十大教育集团、联合国教科文组织技术与职业教育培训在中国的唯一试点项目。


公司下属13家行业远程教育网站,业务涵盖了会计、法律、医学、建设、自考、成考、考研、中小学、外语、信息技术、汉语言教学等诸多领域,拥有办公面积8000多平米,员工近千人,公司年招生规模达270万人。由于正保远程教育(China Distance Education Holdings Ltd., CDEL)在中国互联网远程教育行业内的绝对优势和强大影响力,正保教育模式一直被广大投资人所追捧。2008年7月30日,公司在美国纽约证券交易所正式挂牌上市(股票交易代码:DL),是2008年唯一一家在美国纽交所上市的专业从事互联网远程教育的中国企业。


精彩推荐

版权声明
   1、凡本网注明 “来源:外语教育网”的所有作品,版权均属外语教育网所有,未经本网授权不得转载、链接、转贴或以其他方式使用;已经本网授权的,应在授权范围内使用,且必须注明“来源:外语教育网”。违反上述声明者,本网将追究其法律责任。
  2、本网部分资料为网上搜集转载,均尽力标明作者和出处。对于本网刊载作品涉及版权等问题的,请作者与本网站联系,本网站核实确认后会尽快予以处理。
  本网转载之作品,并不意味着认同该作品的观点或真实性。如其他媒体、网站或个人转载使用,请与著作权人联系,并自负法律责任。
  3、本网站欢迎积极投稿
  4、联系方式:
编辑信箱:for68@chinaacc.com
电话:010-82319999-2371