|
Reviews For Paper
Paper ID |
272 |
Title |
Learning to Discover Social Circles in Ego Networks |
Masked Reviewer ID:
|
Assigned_Reviewer_10
|
Review:
|
|
Question | |
Comments to author(s). First provide a
summary of the paper, and then address the following criteria: Quality,
clarity, originality and significance. (For detailed reviewing
guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions) |
The authors investigate a creation of groups or circles in social networks.
They suggest that groups of friends/family/colleagues/school-mates
etc can be uncovered by an unsupervised process by examining only the
users' ego-network which is the social network reduced on a single
user's set of connections.
Such groups, they claim, are identifiable by both the the local network structure and the users' profile features.
The main point of the paper, as I see it, is that these two types of information must be utilized simultaneously.
The authors propose a statistical model for which they try to fit to the ego network.
Model parameters and clustering are simultaneously optimized to maximize the log likelihood of the evidence (the network).
Let L(Theta,C) be the log likelihood of the evidence given model parameters Theta and clustering C.
An expectation maximization algorithm is proposed which maximizes L over Theta and C alternately.
Maximizing over Theta given C amounts to solving a convex optimization problem.
Alas, finding C given Theta is hard. The authors resort to using general purpose `pseudo-boolean' solvers for this task.
This step is quite problematic from the model prospective. The
authors allow clusters to overlap and be of arbitrary size and number.
It is unclear why the trivial solution assigning each edge (above
probability 1/2) a cluster of size 2 which includes its two end points
is not good.
That will give an optimal solution for C which is uninformative. I
could not understand from the manuscript what prevents this trivial
solution. The authors should explain this.
Experimentally, the authors give very compelling evidence to suport their claims.
Their experimentation and data are thorough and produce, what seems to be, quite impressive results.
Since I am not an expert on this topic, I cannot vouch for the completeness of the references.
|
Please summarize your review in 1-2 sentences |
Given that the technical difficulty described above is resolved I recommend to accept this paper.
|
Masked Reviewer ID:
|
Assigned_Reviewer_4
|
Review:
|
|
Question | |
Comments to author(s). First provide a
summary of the paper, and then address the following criteria: Quality,
clarity, originality and significance. (For detailed reviewing
guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions) |
Summary
Social networks analysis. The paper studies the problem of
automatically discovering users’ social circles from so-called Ego
networks (i.e. the network formed by extraction a node and its
neighbors). Proposes a simple probabilistic model with latent multiple
assignments to "circles". Results are good, method works best in
Facebook data.
COMMENTS:
Significance:
This type of social network analysis may have obvious business
applications (can help to form social circles) but could also be of
importance for better understanding of mechanisms in network science.
Quality:
Simple well motivated model. Use powerful option tools. Could have
been more careful on the complexity issues, and discussion of uniqueness
of the solution. Uses BIC and simple cross validation for model choice
and parameters - could have been much better argued why these choice are
made (including the range of lambdas, is it not data dependent??).
Clarity:
Very clear state of the art, methods and experiments. Machine
learning tools could have been better argued. Conclusions are weak.
Originality:
Moderate, many similar methods. Similar work on "dyadic prediction"
with soft latent variables exist in the literature (Menon&Elkan,
http://arxiv.org/abs/1006.2156)
|
Please summarize your review in 1-2 sentences |
Nice well-written paper on an important
issue in network science. Well-motivated method. Inference and
evaluation could have been improved, as well as results and conclusion
appear weak.
|
Masked Reviewer ID:
|
Assigned_Reviewer_9
|
Review:
|
|
Question | |
Comments to author(s). First provide a
summary of the paper, and then address the following criteria: Quality,
clarity, originality and significance. (For detailed reviewing
guidelines, see http://nips.cc/PaperInformation/ReviewerInstructions) |
Summary:
The authors suggest an
algorithm for automatically detecting social circles within ego networks
of online friends, thereby alleviating the users from manually creating
and updating lists of friends. They suggest to learn weighted
similarity metric for each circle with a simple discriminative model. As
features they convert 2 profile trees (or 2 sets of hash tags) into a
binary feature vector. The paper emphasizes four ways to convert the
profile information into features: between friend and ego / between two
mutual friends combined with or without a compression schema. The
compression schema aggregates that two categories are shared, but
ignores the values, e.g. ignoring the information which affiliation was
shared in the feature vector. For twitter data the compression scheme
just counts the number of hash tags in common. The model accounts for
hard assignments to overlapping and fully including circles. The
inference algorithm is cast as a pseudo boolean optimization problem
(making use of a software package). The model was evaluated on three
data sets against a fair amount of baselines with respect to BER
(Balanced Error Rate) and F1.
Strong Points:
- The authors work on a very interesting problem that is very relevant in practice.
-
The model is simple but intuitive. I like the pre-study on labeled data
from Facebook confirming that a decent fraction of the circles are
fully included in another.
- The evaluation is very thorough. Demonstrating the maturity of the work.
- the write-up is very understandable and pleasant to follow.
Weak Points:
-
The paper puts a lot of emphasis on how tree-structured profiles are
turned into binary feature vectors. This discussion is not very
elaborate, the feature construction seems arbitrary. (see more below)
-
Omitting circles that could not be aligned in the evaluation seems not
right to me. I also got the impression that multiple predicted cycles
can be mapped to one ground truth circle. Choosing B^3 ("B cubed") would
alleviated the problem.
Value:
I deem the paper valuable due to its practical applicability.
Detailed Comments:
-
extraction of Binary features from trees has been well studied, e.g.
Haussler "Convolution kernels on discrete structures", 1999 or
Vishwanathan and Smola "Fast kernels for string and tree matching",
2004.
- Twitter experiment: I can't follow why the sheer number
of hash tags in common should contain any useful information. Could the
authors comment on that?
- Do you still find fully included circles in the twitter data set with the compressed features?
-
Given that all four variants perform similarly on twitter, I am
curious: How would the model perform with only the "edge exists" (i.e.
feature index 0 in Figure 5) would perform? Is the profile info helping
for the twitter at all - or is the nature of hard assignments earning
the performance of 0.7 (1 - BER)?
- Equation 1: p(x,y) = exp{ sum_{x,y} ...) - Are those different x and y on both sides of the equation?
- I assume you Equation 3 is for l_{\Theta^K} as references in Equation 9. Right?
-
Why can't you include all four kinds of features into one feature
vector and let the weights decide which ones are most informative?
|
Please summarize your review in 1-2 sentences |
Relevant problem, simple but intuitive
model. I dislike the emphasis on the feature generation that seems crude
and ad hoc.
|
| |