Introduction Methodology Experimentation Conclusion
WebStand: Sociological Analysis of the W3C Standardization Process
XML Warehouse Meets Sociology
F.-X. Dudouet1 I. Manolescu2 B. Nguyen3 P. Senellart2,4
1 2
3 4
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Outline
1 Introduction
Sociological Process Standardization
Case of the World Wide Web
2 Methodology
3 Experimentation
4 Conclusion
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Sociological Process
1 Formulatehypotheses
2 Validate on data
Relevant sociological concepts (individuals, institutions. . . ) Data sources are: existing documents, interviews. . .
3 Conclude and issue new hypotheses
Issue
How to collect and manage large volumes of heterogeneous information?
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Sociological Process
1 Formulate hypotheses
2 Validateon data
Relevant sociological concepts (individuals, institutions. . . ) Data sources are: existing documents, interviews. . .
3 Conclude and issue new hypotheses
Issue
How to collect and manage large volumes of heterogeneous information?
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Sociological Process
1 Formulate hypotheses
2 Validateon data
Relevantsociological concepts(individuals, institutions. . . ) Data sources are: existing documents, interviews. . .
3 Conclude and issue new hypotheses
Issue
How to collect and manage large volumes of heterogeneous information?
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Sociological Process
1 Formulate hypotheses
2 Validateon data
Relevant sociological concepts (individuals, institutions. . . ) Data sourcesare: existing documents, interviews. . .
3 Conclude and issue new hypotheses
Issue
How to collect and manage large volumes of heterogeneous information?
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Sociological Process
1 Formulate hypotheses
2 Validate on data
Relevant sociological concepts (individuals, institutions. . . ) Data sources are: existing documents, interviews. . .
3 Conclude and issuenew hypotheses
Issue
How to collect and manage large volumes of heterogeneous information?
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Sociological Process
1 Formulate hypotheses
2 Validate on data
Relevant sociological concepts (individuals, institutions. . . ) Data sources are: existing documents, interviews. . .
3 Conclude and issue new hypotheses
Issue
How to collect and managelargevolumes ofheterogeneous information?
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact
Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic Processalmost finished
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic Processalmost finished
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic Processalmost finished
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Standardization
Standard negociations
=⇒ Importanteconomicandpoliticalimpact Issue
Who? Why? How?
Example
XQuerystandardization scene [ACI Normes et politiques publiques]
Arena quiteaccessiblevia mailing lists Author’s acquaintancewith the topic Processalmost finished
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Case of the World Wide Web
Inestimablesource of data
Much human activity involveWeb technology But:
Heterogeneityof sources
Not suitedto classical database systems Need ofconceptual models
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Case of the World Wide Web
Inestimablesource of data
Much human activity involveWeb technology But:
Heterogeneityof sources
Not suitedto classical database systems Need ofconceptual models
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Case of the World Wide Web
Inestimablesource of data
Much human activity involveWeb technology But:
Heterogeneityof sources
Not suitedto classical database systems Need ofconceptual models
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Case of the World Wide Web
Inestimablesource of data
Much human activity involveWeb technology But:
Heterogeneityof sources
Not suitedto classical database systems Need ofconceptual models
Introduction Methodology Experimentation Conclusion
Sociological Process Standardization
Case of the World Wide Web
Case of the World Wide Web
Inestimablesource of data
Much human activity involveWeb technology But:
Heterogeneityof sources
Not suitedto classical database systems Need ofconceptual models
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Outline
1 Introduction
2 Methodology
Conceptual process XML Warehousing
Data filtering and enrichment Complementary sociological tools
3 Experimentation
4 Conclusion
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Modelling and analysis process
Modelling the relevantsociological entities(actors, institutions, functions, messages, time)
Designing awarehouse of Web resourcesrelevant to the sociological analysis
Exploitingthe warehouse (feeding the warehouse, issuing queries)
Queriesenableverification of the hypotheses
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Modelling and analysis process
Modelling the relevantsociological entities(actors, institutions, functions, messages, time)
Designing awarehouse of Web resourcesrelevant to the sociological analysis
Exploitingthe warehouse (feeding the warehouse, issuing queries)
Queriesenableverification of the hypotheses
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Modelling and analysis process
Modelling the relevantsociological entities(actors, institutions, functions, messages, time)
Designing awarehouse of Web resourcesrelevant to the sociological analysis
Exploitingthe warehouse (feeding the warehouse, issuing queries)
Queriesenableverification of the hypotheses
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Modelling and analysis process
Modelling the relevantsociological entities(actors, institutions, functions, messages, time)
Designing awarehouse of Web resourcesrelevant to the sociological analysis
Exploitingthe warehouse (feeding the warehouse, issuing queries)
Queriesenableverification of the hypotheses
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Warehouse construction process
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
XML Warehousing
Pros
Semi-structuredinformation Flexibility
Language of the Web
Tree structureof a mailing list Simpleto understand
Q
ueries on XML warehouses:XQueryitself!
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
XML Warehousing
Pros
Semi-structuredinformation Flexibility
Language of the Web
Tree structureof a mailing list Simpleto understand
Q
ueries on XML warehouses:XQueryitself!
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
XML Warehousing
Pros
Semi-structuredinformation Flexibility
Language of the Web
Tree structureof a mailing list Simpleto understand
Q
ueries on XML warehouses:XQueryitself!
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
XML Warehousing
Pros
Semi-structuredinformation Flexibility
Language of the Web
Tree structureof a mailing list Simpleto understand
Q
ueries on XML warehouses:XQueryitself!
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
XML Warehousing
Pros
Semi-structuredinformation Flexibility
Language of the Web
Tree structureof a mailing list Simpleto understand
Q
ueries on XML warehouses:XQueryitself!
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
XML Warehousing
Pros
Semi-structuredinformation Flexibility
Language of the Web
Tree structureof a mailing list Simpleto understand
Q
ueries on XML warehouses:XQueryitself!
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Data filtering and enrichment
Identifyreal-world objectsrepresented in the warehouse First name, last name, institution from e-mails
Identifying institutions participating in the process
Classifythese objects according toapplication-driven criteria
Issue classification queries topopulateinteresting classes (iterative process)
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Data filtering and enrichment
Identifyreal-world objectsrepresented in the warehouse First name, last name, institution from e-mails
Identifying institutions participating in the process
Classifythese objects according toapplication-driven criteria
Issue classification queries topopulateinteresting classes (iterative process)
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Data filtering and enrichment
Identifyreal-world objectsrepresented in the warehouse First name, last name, institution from e-mails
Identifying institutions participating in the process
Classifythese objects according toapplication-driven criteria
Issue classification queries topopulateinteresting classes (iterative process)
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Data filtering and enrichment
Identifyreal-world objectsrepresented in the warehouse First name, last name, institution from e-mails
Identifying institutions participating in the process
Classifythese objects according toapplication-driven criteria
Issue classification queries topopulateinteresting classes (iterative process)
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Statistics tools (social properties and group extraction) Human annotation and validation
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Statistics tools (social properties and group extraction) Human annotation and validation
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Statistics tools (social properties and group extraction) Human annotation and validation
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Statistics tools (social properties and group extraction) Human annotation and validation
Introduction Methodology Experimentation Conclusion
Conceptual process XML Warehousing Data filtering and enrichment Complementary sociological tools
Complementary sociological tools
Issue
Information on the Web hasholes Missinginformation
Important dimensions (e.g. time)implicitlyornot at all represented
Need tocrossvarious sources to establish information
Tools
Interviews, inside information Human-readable data sources
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Outline
1 Introduction
2 Methodology
3 Experimentation Warehouses
Queries and results Sociological interpretation
4 Conclusion
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Message warehouse
[email protected] list.
Data
5626messages 2718threads
Maximumthread depth: 12
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Message warehouse
[email protected] list.
Data
5626messages 2718threads
Maximumthread depth: 12
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Message warehouse
[email protected] list.
Data
5626messages 2718threads
Maximumthread depth: 12
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Message warehouse
[email protected] list.
Data
5626messages 2718threads
Maximumthread depth: 12
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Actors warehouse
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Queries
Extractinstitutions(for human annotation) Extractactors
Classify actors byaffiliation
Classify actors bymultiple affiliation Analyze interactionwithin threads Volume of interactionbyaffiliation profile
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Queries
Extractinstitutions(for human annotation) Extractactors
Classify actors byaffiliation
Classify actors bymultiple affiliation Analyze interactionwithin threads Volume of interactionbyaffiliation profile
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Queries
Extractinstitutions(for human annotation) Extractactors
Classify actors byaffiliation
Classify actors bymultiple affiliation Analyze interactionwithin threads Volume of interactionbyaffiliation profile
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Queries
Extractinstitutions(for human annotation) Extractactors
Classify actors byaffiliation
Classify actors bymultiple affiliation Analyze interactionwithin threads Volume of interactionbyaffiliation profile
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Queries
Extractinstitutions(for human annotation) Extractactors
Classify actors byaffiliation
Classify actors bymultiple affiliation Analyze interactionwithin threads Volume of interactionbyaffiliation profile
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Queries
Extractinstitutions(for human annotation) Extractactors
Classify actors byaffiliation
Classify actors bymultiple affiliation Analyze interactionwithin threads Volume of interactionbyaffiliation profile
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Sample results
Actor repartition and volume of interaction by affiliation profile
Profile # actors # messages
Companies 135 2689
Universities 39 112
Organizations 33 197
Companies & Universities 3 532 Companies & Organizations 22 1052 Universities & Organizations 6 36
Non specified 65 681
Total 303 5299
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Sociological interpretation
Companiesinvolvedin XQuery standardization CompaniesdominateXQuery standardization Key actorstend to havemultiple affiliation Not everybodyparticipate in the sameway;
Company/University participants most visible
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Sociological interpretation
Companiesinvolvedin XQuery standardization CompaniesdominateXQuery standardization Key actorstend to havemultiple affiliation Not everybodyparticipate in the sameway;
Company/University participants most visible
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Sociological interpretation
Companiesinvolvedin XQuery standardization CompaniesdominateXQuery standardization Key actorstend to havemultiple affiliation Not everybodyparticipate in the sameway;
Company/University participants most visible
Introduction Methodology Experimentation Conclusion
Warehouses Queries and results Sociological interpretation
Sociological interpretation
Companiesinvolvedin XQuery standardization CompaniesdominateXQuery standardization Key actorstend to havemultiple affiliation Not everybodyparticipate in the sameway;
Company/University participants most visible
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Outline
1 Introduction
2 Methodology
3 Experimentation
4 Conclusion Summary Perspectives
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Summary
Interdisciplinaryapproach
Use ofsemi-structuredtechnology forsociologicalstudy Built anXML warehousebased on XQuery public W3C information
Preliminary analysisof the warehouse data
Companies seem to befirst in standardization process
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Summary
Interdisciplinaryapproach
Use ofsemi-structuredtechnology forsociologicalstudy Built anXML warehousebased on XQuery public W3C information
Preliminary analysisof the warehouse data
Companies seem to befirst in standardization process
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Summary
Interdisciplinaryapproach
Use ofsemi-structuredtechnology forsociologicalstudy Built anXML warehousebased on XQuery public W3C information
Preliminary analysisof the warehouse data
Companies seem to befirst in standardization process
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Summary
Interdisciplinaryapproach
Use ofsemi-structuredtechnology forsociologicalstudy Built anXML warehousebased on XQuery public W3C information
Preliminary analysisof the warehouse data
Companies seem to befirst in standardization process
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Summary
Interdisciplinaryapproach
Use ofsemi-structuredtechnology forsociologicalstudy Built anXML warehousebased on XQuery public W3C information
Preliminary analysisof the warehouse data
Companies seem to befirst in standardization process
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Generic Framework for the Social Scientist
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Future Work
Textual analysisof message contents (e.g. agree/disagree) Proper management oftemporal dimension
Enrichedactor warehouse with more sources (WWW in particular)
Similar work onlarger/other/privatemailing lists Morecomplexqueries
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Future Work
Textual analysisof message contents (e.g. agree/disagree) Proper management oftemporal dimension
Enrichedactor warehouse with more sources (WWW in particular)
Similar work onlarger/other/privatemailing lists Morecomplexqueries
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Future Work
Textual analysisof message contents (e.g. agree/disagree) Proper management oftemporal dimension
Enrichedactor warehouse with more sources (WWW in particular)
Similar work onlarger/other/privatemailing lists Morecomplexqueries
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Future Work
Textual analysisof message contents (e.g. agree/disagree) Proper management oftemporal dimension
Enrichedactor warehouse with more sources (WWW in particular)
Similar work onlarger/other/privatemailing lists Morecomplexqueries
Introduction Methodology Experimentation Conclusion
Summary Perspectives
Future Work
Textual analysisof message contents (e.g. agree/disagree) Proper management oftemporal dimension
Enrichedactor warehouse with more sources (WWW in particular)
Similar work onlarger/other/privatemailing lists Morecomplexqueries