The troubles with studying Facebook profiles at Harvard

Many researchers would like to get their hands on SNS/Facebook profile data but one well-known dataset put together by Harvard researchers has come under fire:

But today the data-sharing venture has collapsed. The Facebook archive is more like plutonium than gold—its contents yanked offline, its future release uncertain, its creators scolded by some scholars for downloading the profiles without students’ knowledge and for failing to protect their privacy. Those students have been identified as Harvard College’s Class of 2009…

The Harvard sociologists argue that the data pulled from students’ Facebook profiles could lead to great scientific benefits, and that substantial efforts have been made to protect the students. Jason Kaufman, the project’s principal investigator and a research fellow at Harvard’s Berkman Center for Internet & Society, points out that data were redacted to minimize the risk of identification. No student seems to have suffered any harm. Mr. Kaufman accuses his critics of acting like “academic paparazzi.”…

The Facebook project began to unravel in 2008, when a privacy scholar at the University of Wisconsin at Milwaukee, Michael Zimmer, showed that the “anonymous” data of Mr. Kaufman and his colleagues could be cracked to identify the source as Harvard undergraduates…

But that boon brings new pitfalls. Researchers must navigate the shifting privacy standards of social networks and their users. And the committees set up to protect research subjects—institutional review boards, or IRB’s—lack experience with Web-based research, Mr. Zimmer says. Most tend to focus on evaluating biomedical studies or traditional, survey-based social science. He has pointed to the Harvard case in urging the federal government to do more to educate IRB’s about Web research.

It sounds like academics, IRBs, and granting agencies still need to figure out acceptable standards for collecting such data. But I’m not surprised that the primary issue that arose had to do with identifying individual users and their profiles as this is a common issue when researchers ask for or collect personal information. Additionally, this dataset intersects with a lot of open concerns about Internet privacy. Perhaps some IRBs could take on the task of leading the way for academics and other researchers who want to get their hands on such data.

It is interesting that these concerns arose because of the growing interest in sharing datasets. The Harvard researchers and IRB allowed the research to take place so I wonder if all of this would have ever happened if the dataset didn’t have to be shared where others could then raise issues.

I understand that the researchers wanted to collect the profiles quietly but why not ask for permission? How many Harvard students would have turned them down? I think most college students are quite aware of what can happen with their profile data and they take care of the issue on the front end by making selections about what they display. The researchers could then offer some protections in terms of anonymity and who would have access to the data. Or what about having interviews with students who would then be asked to load their profile and walk the researcher through what they have put online and why it is there?

Nevada opens path to driverless cars

Even though driverless cars are not a common product yet, Nevada has opened a legal path for driverless cars on the road:

Assembly Bill 511, the first such legislation in the country, allows the state’s Department of Transportation to draw up rules that would authorize driverless cars. The regulations would include safety standards, insurance requirements and testing sites.

A driverless car is defined by the bill as using “artificial intelligence, sensors and global positioning system coordinates to drive itself without the active intervention of a human operator.” That includes technology such as lasers, cameras and radar…

Stanford University robotics professor Sebastian Thrun, a project leader on Google’s effort, said that nearly all driving accidents are due to human error rather than mistakes by machines.

“Do you realize that we could change the capacity of highways by a factor of two or three if we didn’t rely on human precision on staying in the lane but on robotic precision, and thereby drive a little bit closer together on a little bit narrower lanes and do away with all traffic jams on highways,” he said in a speech at the TED 2011 conference this spring.

So how long until this becomes a reality? It seems like we have been hearing about these possibilities for years. Here are a few things that could be holding up the process:

1. The legal side of things. Perhaps Nevada is really a pioneer here and will get the ball rolling.

2. The technology is not quite ready yet. It doesn’t sound like this is the issue.

3. We were waiting for a few companies to really push this. It is interesting that Google seems to be getting a lot of the attention. Obviously, their main business is not driverless cars but they had the resources and interest.

4. The cultural side: are people ready to see driverless cars on the road? Even if they are proven to be safer, will people accept them quickly or will it take some time?

Facebook information and privacy: enticing or overwhelming?

There are a lot of users of Facebook and similar sites. One of the primary concerns of users is privacy: who can see their personal information and how it might be used. Two commentators talk about how users respond to this issue:

Sociologist Nathan Jurgenson has an interesting post about Facebook and his skepticism about proclamations of the end of privacy and anonymity. He deploys the postmodernist/poststructuralist insight that each piece of information shared raises more questions about what hasn’t been said, and thus strategic sharing can create different realms of personal privacy and public mystery.

We know that knowledge, including what we post on social media, indeed follows the logic of the fan dance: we always enact a game of reveal and conceal, never showing too much else we have given it all away. It is better to entice by strategically concealing the right “bits” at the right time. For every status update there is much that is not posted. And we know this. What is hidden entices us.

I think this is missing the point. I feel like I need to use all caps to stress this: LOTS OF PEOPLE DON’T WANT ATTENTION. They don’t want to be enticing. Privacy is not about hiding the truth. It’s about being able to avoid the spotlight…

Social media confronts us with how little control we have over our public identity, which is put into play and reinterpreted and tossed around while we watch—while all the distortions and gossip gets fed back to us by the automated feedback channels. Some people find this thrilling. Others find it terrible. It’s always been true that we don’t control how we are seen, but at least we could control how much we had to know about it. It’s harder now to be aloof, to be less aware of our inevitable performativity. We are forced instead to fight for the integrity of our manufactured personal brand.

Jurgenson seems to be referring to the impression management work done by users who are able to craft their image. Most users know that certain pieces of information can hurt them, such as unpleasant photos, so they don’t include that information. Even more so, users try to present a positive image of themselves with generally happy pictures and an acceptable set of interests and activities. And there is a lot that is hidden: I would guess that a majority of users post pretty infrequently. This impression management, reminiscent of Goffman’s front-stage/back-stage dichotomy, has been well established by researchers.

Rob Horning, responding to Jurgenson, suggests that Facebook exposes “how little control we have over our public identity.” This may be true: even small pieces of information might present problems. Additionally, I think he is right in saying that a lot of users don’t want attention: they simply want a low-maintenance way to connect with current and past friends.

But, I would argue that users have a good amount of control over their “public identity” on the Internet. To start, they don’t have to participate and a sizable minority does not. It seems like the easiest way to lose control over what is available on the Internet is to post it yourself, whether on Facebook or a blog or Twitter feed or somewhere else. Second, even if one does participate, Jurgenson suggests that much still remains hidden. There are few people who are willing to reveal everything and few who actually want to. (I’ve always wondered if Facebook users are mostly annoyed with those people who do seem to present everything, good and bad, through their profiles.) Third, one can be friends who they want, limiting who is going to see and possibly use this information. I think a lot of the genius of Facebook is that users feel like they are in control of these aspects and generally resist efforts that use their information in ways that they may not desire. In the end, there are ways in which one can participate without doing much or exposing much.

Horning’s conclusion is interesting: “It’s harder now to be aloof, to be less aware of our inevitable performativity. We are forced instead to fight for the integrity of our manufactured personal brand.” Perhaps this is the real issue, not privacy: since we know that there are others crafting their personal image, we now have the choice to keep up or not. It is not quite a competition but rather mediated social interaction where we can see how others (and they can see how we) “put ourselves together” online. The SNS realm is now another social realm to worry about and it is hard to get away from: did I post a witty enough comment? Is that picture flattering of me? Should I be Facebook friends with that person I never really talked to? These decisions may be consequential…or they may not.

The (terrible?) world of “professional” Amazon reviewers

A recent study of some of Amazon.com’s top 1000 reviewers has PC Magazine writer John Dvorak questions the validity of their reviews:

In the first academic study of its kind, Trevor Pinch, Cornell University professor of sociology and of science and technology studies, independently surveyed 166 of Amazon’s top 1,000 reviewers, examining everything from demographics to motives. What he discovered was 85 percent of those surveyed had been approached with free merchandise from authors, agents or publishers.

Pinch, who also found the median age range of the reviewers he surveyed was 51 to 60, a surprise said Pinch, because the image of the internet is more of a young person’s thing. Amazon is encouraging reviewers to receive free products through Amazon Vine, an invitation-only program in which the top 1,000 reviewers are offered a catalog of free products to review…

This is the fraud aspect of the process that cannot be tolerated. And now to find out they are in a much older demographic makes me think they are just product hoarders who will say what they need to say to get more products. This conclusion is hinted at by the professor.

I do not like man on the street reviews. I never have, and I’ve always thought they could be easily corrupted by smart public relations folks who have already dove into what they call social media. This includes phony personas on Twitter and Facebook that are used to sway public opinion, shipping free goodies to “influential” bloggers, and things like this Amazon scandal.

Dvorak is not really arguing that reviews are not valuable but rather that because Amazon does not fully disclose how these reviewers operate, customers could be duped. The problem here is trust: Dvorak and others might assume that reviewers are doing this out of the goodness of their hearts but instead they are “professionals.” Instead, these reviewers are being “paid.” This is a classic gatekeepers problem: how do you know that a reviewer is trustworthy and giving unvarnished opinions? There are plenty of critics these days for various media outlets and websites. I suspect many average citizens have to read through multiple reviews from a single critic to see if their thoughts line up with their own or to see if they are consistent.

Of course, Amazon relies on a crowd sourcing approach, just like aggregator websites such as Rotten Tomatoes or Metacritic. Do these top reviewers really sway people’s opinions about products since there are often many others who provide reviews of the same products?

Why not ask Amazon whether critical reviewers have been kicked out of these programs? Dvorak is suggesting that these reviewers would speak positively about products just in order to receive more – couldn’t Amazon fight back against this?

My first thoughts when I saw this study a while back was that how confident could Pinch be about his findings based on 166 reviewers. Why not go for a larger sample out of the 1000 Top Reviewers?

(Side note: at the end, Dvorak applauds Pinch for tackling this topic:

By the way (and off topic), you should read my writings over the past 30 years, because I have been hounding sociologists around the world to begin to study these sorts of computer and Internet activities. Give Professor Pinch an award, will you! Maybe that will encourage more studies.

Maybe so.)

Job for sociology majors: online “community managers”

Rawn Shah, “an expert in collaboration and social computing methodologies within organizations and on the Web,” suggests that sociology (and other social science majors) can fill some tech jobs:

Social science brings much more than just this one technique; it reaches particular personalities who are keenly interested in understanding human relationships in its many forms. By interacting over online environments, we lose some key elements of how we communicate and understand each other, in particular, body language and facial expressions. Instead we have to discover the new ways of understanding communication; rediscover the hidden conversations, feelings and emotions in verbal communication now that non-verbal signals may be missing. This leads to just the kind of personality characteristics and skills that are crucial to Community Manager job roles…

Consider this, jobs that involve relationships with groups of people, whether customers, employees, partners or otherwise, are rapidly on the increase as more companies build online communities and participate in social environments online to interact for business reasons. They aren’t simply looking for people who know how to use Facebook; they need people who can interact well, understand relationships, and in particular understand what is not being said, and sense the feel of others. For businesses, these are the roles that make collaboration on the larger scale across the organization happen.

However, I have yet to see the art and science of managing online communities become a regular aspect of college curriculum. I should say I have tried: in 2006-07 I taught the subject as a guest lecturer at the Eller College of Business at University of Arizona. However, the college did not continue the subject after I was not available. In discussing this point with other Social Business and community thought-leaders, they suggested that it was just too early then. Five years later, the demand is there and we don’t seem to have enough of a supply to fill it.

Certainly any company wants candidates who are well experienced with being Community Managers because the best ones have fine tuned their skills through the many interactions. There are many hard skills that you can definitely apply to the job, but quality comes with experience. Every community is different and hence the job can be very pragmatic and audience specific. However, to get to this point we need community managers who understand and practice the basics first. Let’s really start looking into social sciences as one source of supply for the future of work.

I would suspect that many sociology majors wouldn’t even know that these sorts of jobs are possible. But the description should fit what sociology majors can do: “people who can interact well, understand relationships, and in particular understand what is not being said, and sense the feel of others” and promote collaboration within organizations. These are marketable, practical skills that sociology majors should know something about.

Shah suggests there is a matching problem: sociology students and tech companies need to develop a method by which they can find each other. Perhaps it is because there are not many college courses that make this clear, as Shah suggests, but I wonder if it could also be that sociology as a discipline is behind the curve in tackling the burgeoning tech realm.

The age of “neophilia”

A new book cites a sociologist who says we are in a world of “neophilia”:

We are addicted to new products, say Botsman and Rogers. They cite Colin Campbell, a professor of sociology at the University of York, for the diagnosis – that we suffer from ‘neophilia,’ where novelty seeking is the new phenomenon. “Pre-modern societies tend to be suspicious of the novel. It is a feature of modernity that we are addicted to novelty.”

As a stark example of how obsolescence was built into our minds, the book traces the tale of how GM’s Alfred Sloan launched Chevrolet by convincing his team ‘to restyle the body covering of what was essentially a nine-year-old piece of technology under the banner of product innovation.’ The Chevrolet was a remarkable success and the idea of ‘perceived obsolescence’ and ‘change for change’s sake’ was born, the authors note.

“GM went so far as to define its strategy as choreographed cosmetic ‘upgrades’ to ‘Keep the Consumer Dissatisfied.’ In 1929, Charles Kettering, director of research for Sloan, wrote an article declaring, ‘The key to economic prosperity is the organised creation of dissatisfaction…’”

Of course, this obsolescence means more products are sold. It would be intriguing to be privy to some of the conversations corporations must have about particular products: “do we make it a little cheaper so the consumer has to buy a similar product sooner or do we aim for a higher reliability rating in Consumer Reports“? (Do the reliability rankings in Consumer Reports necessarily correspond with the longevity of products or how long consumers hold on to them?)

The contrast between the pre-modern and modern world is interesting: we moderns are skeptical of tradition and conservatism. Does this mean “neophilia” is a product of the Enlightenment?

Facebook loses users in the US, UK – what does it mean?

Facebook has had a meteoric rise – but there are some signs that the growth is slowing:

Fearing for their privacy or perhaps just bored with using the site, 100,000 Britons are said to have deactivated their accounts last month.

And Facebook fatigue seems to be catching. Six million logged off for good in the U.S. too, figures show.

Worldwide, the rate of growth has slowed for a second month in a row – and as it aims to reach its goal of one billion active users, Facebook is having to rely on developing countries to boost its numbers…

‘By the time Facebook reaches around 50 per cent of the total population in a given country, growth generally slows to a halt,’ [Eric Eldon] explained.

This article is rife with speculation: users could be upset with privacy, people could be fatigued or bored with Facebook, etc. Here are a few of these scenarios with my own thoughts:

1. There are only so many people in the world who will use Facebook anyway. It requires using the Internet consistently, whether this is by computer or some mobile device. While it may be “normal” for the younger generations (though the user rate is not 100% here either), it is used less by older generations (even though there has been growth among these sectors). I wonder what sort of saturation point Facebook itself predicted.

1a. Is it really a big deal if Facebook’s growth is now concentrated in developing countries? Is this really any different than many other American companies?

1b. Perhaps we have entered Facebook’s “mature” stage where they can no longer coast based on word-of-mouth and spectacular growth and now need to fight for new users. How long until we see Facebook TV ads trying to entice new users?

2. The article suggests the novelty of Facebook might be wearing off. Perhaps it doesn’t have enough new features – even though the changes in recent years have induced much hand-wringing, it really hasn’t changed that much. Perhaps it has too many people on there and is no longer exclusive enough – this point was driven home by The Social Network as the Winklevoss’ started with a plan to capitalize on the exclusivity of Harvard.

2a. I wonder if Facebook itself is happy with the progress it has made. On one hand, it could generate a lot of money based on targeted advertising. On the other hand, there is some evidence that Zuckerberg wishes it was much more open than it is now. Even though there are no more networks, many people are still tied to friends and acquaintances and don’t wander too far beyond this. How do you connect these newer users around the world to established users or would this be a no-go among users?

2b. The day-to-day novelty of the product should consist of what one’s friends add to the site. Without interesting status updates, pictures, news, and more, what else draws users? Farmville? Making a “friend” connection is one thing – but this is not too interesting if neither side adds new information. So beyond vanity, how can users be provoked to add more?

3. I don’t really buy the privacy argument. Some people are concerned but they are concerned about privacy in a lot of other places as well. If people were really worried about privacy, there would be a lot of things that they wouldn’t do on the Internet, let alone Facebook.

4. Perhaps some people are interested in the story of Facebook losing steam. After all, a narrative where Facebook keeps rising might not be that interesting. How long until we see more stories about competitors to Facebook, like Twitter in the US, or Orkut elsewhere?

5. These numbers regarding the loss of users have no context: how do they compare to similar figures from previous time periods? Is this an increase in the number of users who have left? Certainly, not all users have continued with Facebook after joining.

Dunbar’s number: 150 friends is our limit

It seems like it is pretty easy to collect hundreds of friends on Facebook. Between people we know from years of schools plus jobs plus other activities, the number can increase quickly. But having a large number of online “friends” goes against Dunbar’s number:

Most of Dunbar’s research has focused on why the GORE-TEX model was a success. That model is based on the idea that human beings can hold only about 150 meaningful relationships in their heads. Dunbar has researched the idea so deeply, the number 150 has been dubbed “Dunbar’s Number.”

Ironically, the term was coined on Facebook, where 150 friends may seem like precious few…

Dunbar has found 150 to be the sweet spot for hunter-gatherer societies all over the world. From the Bushmen of Southern Africa to Native American tribes, a typical community is about 150 people. Amish and Hutterite communities — even most military companies around the world — seem to follow the same rule.

The reason 150 is the optimal number for a community comes from our primate ancestors, Dunbar says. In smaller groups, primates could work together to solve problems and evade predators. Today, 150 seems to be the number at which our brains just max out on memory.

Dunbar goes on to suggest that larger organizations then have to find ways to create smaller groups where people can still maintain connections with others.

I’ve thought for a while that Facebook should move away from saying that all people you are connected to are “friends.” This indicates a closeness that I suspect doesn’t really exist in many of these online relationships. They are probably more like “acquaintances” or “people you have interacted with.” But, imagine what would happen if someone you thought was a friend marks you an acquaintance or vice versa. Additionally, by calling everyone a friend, you are suggesting that you are open to a broader set of relationships and Facebook is interested in bringing more people together. If we wanted to get more sociological, we might call these “strong” and “weak” ties but this seems fairly impersonal.

Figures from a few years ago suggested that people had an average of 120 Facebook friends. This still seems like a lot even as sociological research from 2006 (read the full study here) suggests that Americans have fewer confidants and less contact with existing confidants:

In 1985, the average American had three people in whom to confide matters that were important to them, says a study in today’s American Sociological Review. In 2004, that number dropped to two, and one in four had no close confidants at all…

The percentage of people who confide only in family increased from 57% to 80%, and the number who depend totally on a spouse is up from 5% to 9%, the study found. “If something happens to that spouse or partner, you may have lost your safety net,” Smith-Lovin says.

Here are two tentative hypotheses regarding this data:

1. Younger Facebook users are more likely to have higher numbers of friends. This seems to be driven by being students in high school and college where it is common to friend lots of people across a broad swath of classroom, social activity, and living situations.

2. Older Facebook users are more uncomfortable with the term “friends” to describe online relationships. Of course, as the younger generation ages and is used to such terms, the term will become more normal.

Basic sociological question: “what does civilization as we know it rely on?”

Big questions about society can be great for Introduction to Sociology courses. Here is are the sorts of questions that I think could work quite well:

So, what sort of machines do you need to create an industrial civilization—kind of like the ones we have now, but more sensibly sourced. I remember taking a sociology course years ago where we started out with a similar question, although we conceived the question more broadly—what does civilization as we know it rely on? The answer then (decades ago, before the impact of The Whole Earth Catalog had been felt) was something along the lines of “technology.” But this is a much better question.

If we stuck with the second question here, “what does civilization as we know it rely on?”, I could imagine a class could generate a lot of answers:

1. The Internet. In the vast scope of human history, this may seem silly. But for people raised in the Internet era, it would be pretty hard to imagine life without it.

2. Electricity. This makes all sorts of things possible.

3. The steam engine. This helped give rise to the Industrial Revolution.

And so on. But these are all technological changes that could go back to the plow and the wheel and illustrate the human capacity to create and utilize tools. We just happen to live in an era where such technological change is rapid and our daily lives are full of machines. But what about more cultural or sociological phenomena?

1. Language. The ability to communicate in formalized ways gave rise to oral traditions, writing, etc.

2. Government. This doesn’t necessarily have to mean the big bureaucracies of today that impressed Max Weber. But just a form of ruling or authority that helped bring about communities.

3. Sustained agriculture. This has been the traditional answer to how humans were able to create more complex societies in the Fertile Crescent. This is now being challenged by a new argument based on evidence of early religion in Turkey.

I’ll have to think about using these questions in class. They seem particularly good for helping students consider the basic building blocks of human social life before diving into specific sociological phenomena.

Sociologist says “access to information is a fundamental human right”

A sociologist talks about the importance of citizens accessing information:

Access to information is a fundamental human right and democracy can’t function unless you know what government is doing, Dominique Clement, an associate professor at the University of Alberta, said Monday.

“By denying people access to information, you’re denying a human right and you’re denying them knowledge of how governments work, and ultimately that harms our democracy,” Clement, a sociology professor, said during a Canadian Historical Association panel discussion at the Congress of the Humanities and Social Sciences.

Calling freedom of information law in Canada “draconian,” Clement, who’s filled about 500 information requests throughout his career, said reform needs to happen nationwide in order for those laws to be effective.

He said privacy commissioners in the provinces should become more arm’s length than they are now and should be answerable to the legislative assembly or parliament, not to any premier or prime minister.

I wonder how democratic governments would respond to this argument. I imagine they would support it and then argue that certain information need to be protected because of national security and other reasons. One doesn’t have to be a conspiracy theorist to see that there is quite a bit of public/government information that is not easily accessible. Of course, non-democratic governments may not be too happy with these arguments as restricting information is deemed vital – see Iran’s recent efforts to create a national Intranet.

But this is related to a thought I have had in the past: is Internet access, particularly because of its ability to share and produce information, going to become a human right in the near future? Should rights regarding information apply to all information on the Internet or just “vital information” that citizens might need to participate in the civic realm? What would be the response in Western nations if Internet access was severely limited, even if a case could be made for it (like a threat of attack)?