Discover
/
Article

Data science tackles massive digital output

AUG 18, 2014
Physics Today

New York Times : Because of the ever-increasing amounts of data being generated by the Web, smartphones, and other technologies, data scientists are having to wrangle with the vast output to pare it down and organize it into a usable format. “You spend a lot of your time being a data janitor, before you can get to the cool, sexy things that got you into the field in the first place,” said Matt Mohebbi, a data scientist and cofounder of Iodine, a new health startup. Several companies are writing computer software to automate the data-wrangling process. Among other challenges, the programs must be able to merge many different data formats. In much the same way that spreadsheets revolutionized data analysis in business and finance, machine-learning technology could help free data scientists from the more mundane sorting tasks so they can concentrate on the bigger picture.

Related content
/
Article
The finding that the Saturnian moon may host layers of icy slush instead of a global ocean could change how planetary scientists think about other icy moons as well.
/
Article
/
Article
After a foray into international health and social welfare, she returned to the physical sciences. She is currently at the Moore Foundation.
/
Article
Modeling the shapes of tree branches, neurons, and blood vessels is a thorny problem, but researchers have just discovered that much of the math has already been done.

Get PT in your inbox

pt_newsletter_card_blue.png
PT The Week in Physics

A collection of PT's content from the previous week delivered every Monday.

pt_newsletter_card_darkblue.png
PT New Issue Alert

Be notified about the new issue with links to highlights and the full TOC.

pt_newsletter_card_pink.png
PT Webinars & White Papers

The latest webinars, white papers and other informational resources.

By signing up you agree to allow AIP to send you email newsletters. You further agree to our privacy policy and terms of service.