Discover
/
Article

Data science tackles massive digital output

AUG 18, 2014
Physics Today

New York Times : Because of the ever-increasing amounts of data being generated by the Web, smartphones, and other technologies, data scientists are having to wrangle with the vast output to pare it down and organize it into a usable format. “You spend a lot of your time being a data janitor, before you can get to the cool, sexy things that got you into the field in the first place,” said Matt Mohebbi, a data scientist and cofounder of Iodine, a new health startup. Several companies are writing computer software to automate the data-wrangling process. Among other challenges, the programs must be able to merge many different data formats. In much the same way that spreadsheets revolutionized data analysis in business and finance, machine-learning technology could help free data scientists from the more mundane sorting tasks so they can concentrate on the bigger picture.

Related content
/
Article
/
Article
The availability of free translation software clinched the decision for the new policy. To some researchers, it’s anathema.
/
Article
The Nancy Grace Roman Space Telescope will survey the sky for vestiges of the universe’s expansion.

Get PT in your inbox

pt_newsletter_card_blue.png
PT The Week in Physics

A collection of PT's content from the previous week delivered every Monday.

pt_newsletter_card_darkblue.png
PT New Issue Alert

Be notified about the new issue with links to highlights and the full TOC.

pt_newsletter_card_pink.png
PT Webinars & White Papers

The latest webinars, white papers and other informational resources.

By signing up you agree to allow AIP to send you email newsletters. You further agree to our privacy policy and terms of service.