Search This Blog

Tuesday, February 26, 2013

Trailing the Tiger Trade


You can learn how stealth cameras work in the tiger exhibit at the Bronx Zoo. In at least one case, the picture recorded by the stealth cam   sealed the conviction for tiger poachers in Thailand last year.  Just like fingerprints, the patterns of tiger stripes are individually distinctive.  Consequently, the poacher’s picture proved that they did break the law. Thanks to the laws and enforcement that EIA works for, one was sentenced to four years and another to five, the longest such sentence yet for this sort of crime.  Read more about the efforts to save big cats with big data in

Trailing the Tiger Trade With Big Data

A cell phone image of one of the poachers posing with the dead tiger that led to their conviction. Photo courtesy of the WCS Thailand Program.

.The tiger from the cell phone images was identified as the same tiger captured by a camera trap image by WCS the year before, adding to the evidence against the poachers.

A cell p

Tuesday, February 19, 2013

Writer's homework

When I read articles by people who didn't bother to investigate the subject properly, I really wonder, why are they getting paid for this? I just read a Guardian piece on the advantages and drawbacks of  massive open online course AKA MOOCs.  Iit includes an assertion that the courses only can be given for subjects that involve multiple choice tests: 
Moocs are limited to subjects that can be assessed with multiple choice exams, marked automatically.Written any essays in your degree? Your professor's critique of them can't be replicated by a mooc – yet.
First of all, MOOCs like Coursera have come up with a way around that, as I explained in a blog posted last year:
Another innovative aspect to Coursera is the way it assesses student work in courses that are not limited to technology or mathematics.
As founder Ng observes, "Multiple choice doesn't really work for a poetry class." Also, with thousands enrolled in a single class, instructors would find it impossible to personalize responses to student work.
Coursera's solution to that problem is the introduction of "a system for peer grading, in which students will be trained to evaluate each other's work based on a grading rubric provided by the professor." This is not all that different from peer reviews encouraged in writers' groups, which some teachers employ in their own classroom, though the Coursera system is designed to ascertain that the students comprehend the instructor's standards before being allowed to grade another's work.

 Second of all, there are already some systems to automate grading for written work as I explained here:

For example, Pearson’s Write to Learn is designed to offer instant feedback and personalized direction on student writing. The software can be accessed at computers in the school or through an Internet connection remotely. Teachers using the software are happy to have much of the grunt work associated with guiding students through revision and editing lifted from their shoulders.  The automated critique also reduces personal confrontations. As one teacher featured in a Write to Learn case study says, there's no "evil professor" who delights in finding fault in student work.
Educational Testing Service's e-rater is another automated assessment tool. It can score 16,000 essays in 20 seconds, a breathtaking rate of productivity when compared to the one to two minutes per essay typically allotted to human scorers.
Students responded positively when the New Jersey Institute of Technology introduced e-raters. An assistant professor there, Andrew Klobucar, observes that whereas students see drafting and revising multiple times as "corrective, even punitive," when assigned by evil professors, they do not have the same negative view when doing it for an e-rater.

I do wish writers would do their own homework when offering an opinion on the current state of educational technology.

Wednesday, February 13, 2013

The Big Bow-Wow & a Bit of Ivory

Sir Walter Scott contrasted his style of writing with that of Jane Austen: "The big Bow-Wow strain I can do myself like any now going; but the exquisite touch which renders ordinary commonplace things and characters interesting from the truth of the description and the sentiment is denied to me". While he characterized his work as large, Jane Austen called her own small, a "little bit (two inches wide) of ivory on which I work with so fine a brush."

The two are married together, so to speak, by Mathew Jockers, who declares them the literary equivalent of Homo erectus, or, if you prefer, Adam and Eve,"

Read more about the humanities going Google, as one article put it in my Big Data Republic blog, The Big Bow-Wow & a Bit of Ivory

DNA data storage

The latest developments in data storage turn to biology. DNA sequencing, not only allows for big data to be stored for thousands of years, but allows for a more compact encoding system based on the 4 letters in the DNA bases rather than than binary zeros and ones that have been used until now. Still, the cost makes it prohibitive for now, and the fact that the sequencing make the data impossible to update within DNA make it not quite the same as a flash drive.  Read about it in  Big Data: Tiny Storage

Friday, February 1, 2013

Smartphone signals for retail analytics


Shopping online offers customers convenience and price transparency, but it offers retailers even more. As Amazon has demonstrated in its successful model, the information it derives from its customer behavior online gives it insight that it uses to tailor its marketing to the individual. As your online browsing tracks, not only what you buy, but what you considered buying, the retailer gets to learn a lot more about you than the person who rings up your purchase at a store. How can a bricks-and-mortar establishment compete with that kind of analytic edge?

Post-Sandy Street Views

One thing about big data: it is not static. As there is always changes in a situation, what reflected reality one day can be out of date the next. That is especially true when a hurricane of the likes of Sandy sweeps through and alters the landscape and the structures built on it. 

But not everyone is pleased about updates that include images of their hurricane-ravaged homes. Read more at New Maps After Sandy

Thursday, January 24, 2013

That's big data entertainment!

In the past, the device held by someone watching television was usually a remote control. In the future, it is just as likely to be a mobile device. Starting next fall, television ratings will be measured in tweets as well as Nielsen numbers as social conversations are analyzed to calculate the reach of a program. Nielsen and Twitter announced the new form rating last month, but their partnership has been in the works for over a year. 

Snoopy was prophetic. 3-D movies have made a comeback. Consequently, movies today pack a lot more data per frame. But big data is also involved in the trend toward data streaming that is displacing discs and in the data on what people want to watch. 


Big data drives today’s movie industry, both in terms of the amount of data packed into each frame you see at theaters and in terms of video streaming online.  It’s what delivers 3-D effects in the theater and personalized recommendations to Netflix viewers.  And very big numbers ride on both.

In the past few years, 3-D movies have staged a comeback on a scale much greater than their  brief heyday in the 1950s. Adding in the 3-D effect adds "anywhere from 100% to 200% more data per frame," according to Jeff Denworth, vice president of marketing for DataDirect Networks (DDN). Denworth attributes the proliferation of present-day 3-D films to the huge success James Cameron had with the 3-D film "Avatar" in 2009, which packed a petabyte of data. 

"Avatar" cost about $237 million to produce, but it brought in more than ten times that amount. It earned the distinction of   IMDB identifies it as "the highest-grossing film of all time."  By the beginning of 2010, it had taken in $2,779,404,183.  A rash of 3-D films followed this success, and many did very well. According to iSuppli Market Intelligence (owned by IHS)  in 2011 3-D films brought in $7 billion at the box-office, 16 percent more than the previous year.


 The full figures for 2012 are not yet in, though they will likely be higher as the number of 3-D screens have gone up from about 9,000 in "2009 to 43,000 in by the third quarter" of 2012. One of the biggest draws of the year, Marvel’s 3D superhero flick, "The Avengers," grossed  $1,511,757,910 in 2012.  As 3-D has grown so common at the theater, movie-makers have to point to something else to distinguish their offering.


 "The Hobbit: An Unexpected Journey" had to do 3-D one better with its "brand new format High Frame Rate 3D (HFR 3D)." Instead of the 24 frames per second, which is the movie standard, it packs in 48. The advantage to the viewer, it claims, is that the greater number offers an experience "closer to what the human eye actually sees." Perhaps so, but quite a number of viewers were less than thrilled by the effect. Nevertheless, by December 29, 2012, "The Hobbit" had already taken in $600,508,000, according to IMDB figures.  


Big data is also the key to watching movies on the small screen. Instead of picking up a disc when they buy or rent a movie, people now can just have it come right to them. As Dan Cryan, senior principal analyst at HIS  observed, in 2012 Americans made  "a historic switch to Internet-based consumption, setting the stage for a worldwide migration from physical to online." 


 Estimates of  online movie payments for the US in 2012 are “3.4 billion views or transactions, up from 1.4 billion in 2011.  This form of video streaming is dominated by Netflix in the US, where it makes up "33% of peak period downstream traffic"  Amazon, Hulu, and HBO Go follow far behind at 1.8%, 1.5%, and .5% respectively.  It intends to keep its lead with the help of big data.


Netflix was the subject of a WSJ blog on using big data to improve streaming video.  Though Netflix still offers to mail out the DVDs people select for rental, more customers now opt for streaming.  In the interest of improving efficiency on that end, Netflix transferred its holdings to Amazon’s cloud. It also started using Hadoop, which enables it "to run massive data analyses, such as graphing traffic patterns for every type of device across multiple markets." That helps plan for improved data transmission and better understanding of the customer.


In addition to using big data solutions for delivery of content, Netflix applies algorithms to predict what their customers would likely want to watch next.  This type of data mining technology makes Netflix confident that it can handle hosting original content. In fact, it bet more than  $100 million on it; that’s the reported sum paid for the rights to two seasons of House of Cards, one of several original content series it plans on streaming


 As Netflix’s Chief Communications Officer, Jonathan Friedland, says, "We know what people watch on Netflix and we’re able with a high degree of confidence to understand how big a likely audience is for a given show based on people’s viewing habits." 


So what do you think? Is it possible to guarantee a hit with big data?


Thursday, January 17, 2013

Seeing stones for military, rescue, and security operations


What do JRR Tolkien, JP Morgan Chase, the military, and rescue workers have in common? Palantir.
"The Palantír" is the title of the 11th chapter of Tolkien’s The Two Towers. The name refers to the "seeing stones" that allow one to view what is happening elsewhere. In 2004, the name was also taken on by a company that develops software organization to extract meaning from various streams of data to combat terrorism, fraud, and disaster damage.


Palantir distinguishes its approach from data mining by calling it "data surfacing." Read more at 

From Sorcery to Surfacing Data


For more on big data used by the army, see  

National Safety in Big Numbers

 "You can't have a data Tower of Babel" in which each system keeps its data isolated from other systems, Patrick Dreher, a senior technical director at DRC, told Military Information Technology.His company worked with the US Army on the Rainmaker cloud-based intelligence system, which integrates different data models used by the intelligence community. "For example, when Afghan drug lords finance Taliban insurgents, data from one database can be combined with Taliban financing data from an Army database inside the cloud, allowing analysts to make timely, critical connections and stay one step ahead of insurgents."

Thursday, January 10, 2013

Big Data on the Final Frontier


Missions in space may come and go, but the National Aeronautics and Space Administration has always stuck to a mission of bringing in data.

(
One of its early achievements in this field was sending a spacecraft close enough to Venus to get accurate readings of its surface and atmosphere. On Dec. 14, 1962, the Mariner 2 spacecraft got within 34,762km (21,600 miles) of the planet. Over a 42-minute period, it was able to pick up many points of data that proved Venus, which had been thought of as Earth's twin, would be uninhabitable, with a surface temperature of 425°C (797°F) and a toxic atmosphere.
This picture (from NASA's site) of the data gathered in that mission is cropped. The paper showing the data that was gathered is actually much longer, as this uncropped version shows.

Back then, the data covered a roll of paper, but the data NASA handles today takes supercomputing power to process. As Nick Skytland wrote in NASA blog post in October:
In the time it took you to read this sentence, NASA gathered approximately 1.73 gigabytes of data from our nearly 100 currently active missions! We do this every hour, every day, every year -- and the collection rate is growing exponentially...
In our current missions, data is transferred with radio frequency, which is relatively slow. In the future, NASA will employ technology such as optical (laser) communication to increase the download and mean a 1000x increase in the volume of data. This is much more then we can handle today and this is what we are starting to prepare for now. We are planning missions today that will easily stream more 
[than] 24TB's a day. That's roughly 2.4 times the entire Library of Congress -- EVERY DAY. For one mission.
read more at 

Big Data on the Final Frontier

Sunday, January 6, 2013

Is your face your calling card?


Many books include pictures of the author on the back cover or inside the jacket. That is one thing I never bother to check when considering whether or not I want to read a book.  I  still don't really think about the author's appearance as I read. And I don't really think about my own as I write. 

I use a quill for my signature picture here, as well as on my other blogs. It also serves as  my profile photo  on Facebook, Google+ and Twitter. I feel it conveys what I am about more accurately -- in terms of my role as writer - than my photo would. Or maybe I'm just camera-shy.

On  the other hand, my actual photo does serve as my profile picture for the UBM boards on which I write. The policy there, as it is for many newspapers, is to require a photo for the writers. Those who comment only and don't blog can get away with using any picture they like for their profile photo or just use the default picture if they don't bother to upload one of their own. 


Once I had my picture posted in that way, I put it in for my LinkedIn profile, as well. It seemed more consistent to have the same picture represent me there. Also the more standard practice on LI is to use an actual photo than a representational picture.  I still can't see attaching a photo to a resume, though anyone who wishes to find my photo simply has to do an online search to find one in a fraction of a second.


While the net dooes tend to attach author faces to content,  I don't believe I am more drawn to articles that feature faces.I must be  in the minority, though, because I'm certain that those who demand faces find that they are effective at drawing more audience interest. 


. What do you think about  the face as calling card?

Thursday, December 20, 2012

Inhalers that do more than dispense medication

Louisville, one of IBM's 100 selected selected cities is putting big data to work to track asthma triggers with Asthmapolis. Read about it in

Big Data's Next Target: Asthma

  

Friday, December 14, 2012

Big Data Health Hazards


If anything can go wrong, it will." Murphy's Law (or Sod's Law, as it is known in the UK) applies to big data projects, as well. When those projects concern someone's health, something going wrong in the data can lead to something going very wrong with the patient.
The more one relies on the accuracy of the system, the higher the potential for error. Electronic health records (EHRs) are considered a boon to data aggregation, but they hold a potential downside.  Read more here

Tuesday, November 27, 2012

Big Data Applied to Health

I've written several pieces on the topic from various angles:

On how cell phone data is used to map the spread of Malaria in order to come up with effective prevention in Africa Analyzing Cellphone Data for the Greater Good

On Retrofit's approach: Data Gets Personal to Fight Obesity

On UPMC $10 million dollar big data plan: Creating Custom-Fit Healthcare