Health-related behaviour change through social networking sites: a systematic review and meta-analysis

(1)

Thesis

Reference

Health-related behaviour change through social networking sites: a systematic review and meta-analysis

RUDOLF, Aaron Maria

Abstract

Ce travail analyse l'utilité et l'efficacité des réseaux sociaux pour la promotion de comportements favorables à la santé (une activité physique régulière, par exemple) en la comparant à des interventions dites « standards ». Différentes bases de données ont été parcourues selon une stratégie de recherche pré-spécifiée. Après un processus systématique de sélection, vingt essais randomisés et contrôlés ont été inclus. Deux méta- analyses réalisées démontrent des effets modestes mais statistiquement significatifs en faveur des interventions recourant aux réseaux sociaux pour promouvoir l'activité physique et la gestion du poids corporel. Or, ces résultats sont à considérer avec prudence en raison de limitations inhérentes aux études examinées et interprétées dans ce travail. Afin d'aboutir à une conclusion plus fondée et apte à être transposée à la population générale, les limitations décrites doivent d'abord être solutionnées par les futures recherches.

RUDOLF, Aaron Maria. Health-related behaviour change through social networking sites: a systematic review and meta-analysis. Thèse de doctorat : Univ. Genève, 2020, no. Méd. 10993

DOI : 10.13097/archive-ouverte/unige:137275 URN : urn:nbn:ch:unige-1372751

Available at:

http://archive-ouverte.unige.ch/unige:137275

Disclaimer: layout of this document may differ from the published version.

1 / 1

(2)

Health-related behaviour change through social networking sites:

a systematic review and meta-analysis.

Aaron Maria Rudolf^a, Carlos de Mestral^b, Dusan Petrovic^b, Prof. Idris Guessous^a, Dre Silvia Stringhini^{a, b}.

a Unit of Population Epidemiology, Department of Primary Care, Geneva University Hospitals, Rue Gabrielle-Perret-Gentil 4, 1211 Genève, Switzerland

b University Centre for General Medicine and Public Health, Lausanne University, Route de la Corniche 10, 1010 Lausanne, Switzerland

(3)

2

Table des matières

Introduction (français) ... 3

Résumé du travail (français) ... 8

Conclusion (français) ... 10

Discussion (français) ... 11

1 Abstract ... 17

2 Introduction ... 18

3 Methods... 20

3.1 Definition of social networking sites (SNS)/social media and Web 2.0 ... 22

3.2 Study selection and data extraction ... 22

3.3 Quality assessment ... 24

3.4 Data synthesis ... 24

4 Results ... 26

4.1 Study selection ... 26

4.2 Study characteristics ... 26

4.3 Intervention and adherence/retention rates ... 27

4.4 Quality assessment ... 29

4.5 Comparison and outcomes ... 30

4.6 Effectiveness of social media ... 31

4.7 Meta-analysis ... 32

4.8 Publication bias ... 33

4.9 Narrative approach regarding studies not included in a meta-analysis ... 33

5 Discussion ... 34

5.1 Advantages / drawbacks of Social Networking Sites for health behaviour change 34 5.2 Are RCTs the appropriate study design for this type of research question? ... 36

5.3 Strengths and limitations ... 36

5.4 Implications for future research ... 38

6 Conclusion ... 39

Conflict of interest ... 39

Acknowledgements ... 40

Funding ... 40

Tables and Figures ... 40

References ... 51

Appendix ... 53

(4)

3 Introduction (français)

L’inactivité physique, le tabagisme, l’alimentation malsaine ainsi que la consommation néfaste d’alcool ont été définis par l’Organisation Mondiale de la Santé (OMS) comme étant des facteurs de risque évitables à l’origine de la plupart des maladies non-transmissibles, responsables de changements métaboliques et physiologiques comme l’hypertension, l’obésité, l’hyperglycémie, l’hypercholestérolémie et la mort précoce (World Health Organization WHO, 2019). Or, dans la plupart des pays occidentaux, une grande proportion de la population n’atteint pas les recommandations minimales requises par l’OMS qui consistent soit en 150 minutes d’activité d’endurance d’intensité modérée soit en 75 minutes d’activité d’endurance d’intensité soutenue (World Health Organization WHO, 2015). En Suisse, malgré une myriade d’essais de promotion de la santé par l’Office fédéral de la santé publique et l’Office fédéral de sport, 25% de la population adulte est sédentaire et 42,7% est en surpoids, voire obèse (Pahud and Zufferey, 2019). Au niveau mondial, aujourd’hui, un quart de la population n’est pas suffisamment active physiquement (World Health Organization WHO, 2014, 2018b) et l’obésité a presque triplé depuis 1975 atteignant 1,9 milliards d’adultes en surpoids – dont 650 millions de personnes obèses – en 2016 (World Health Organization WHO, 2018a).

Le tabagisme, quant à lui, est au cœur des stratégies de santé publique depuis plus d’un demi- siècle (Lushniak et al., 2014). Le dernier rapport en la matière montre une réduction du taux global de tabagisme de 22%, ce qui est toutefois encore loin de l’objectif global volontaire établi par l’OMS, à savoir une réduction de 30% entre 2010 et 2025 (World Health Organization WHO, 2018d). Aujourd’hui, le tabagisme tue plus de 8 millions de personnes chaque année, soit de façon directe, soit par l’exposition au tabagisme passif. De même, on estime aujourd’hui que 3 millions de personnes meurent chaque année dans le monde entier à cause d’une consommation excessive d’alcool, soit 5,3% de la totalité des décès annuels, un nombre qui s’élève à 13,5% pour la tranche d’âge des 20-39 ans (World Health Organization

(5)

4

WHO, 2018c; World Health Organization, 2019). Ainsi, les comportements néfastes à la santé sont toujours très répandus dans le monde et génèrent des coûts importants en lien avec le traitement des maladies non-transmissibles qui en résultent et/ou la perte conséquente de capital humain(World Health Organization, 2019). Les coûts totaux de santé augmentent en moyenne plus vite que le produit intérieur brut, aussi bien dans les pays occidentaux (4% en moyenne) que dans les pays en voie de développement (proche de 6%) (Xu et al., 2018). En Suisse, les coûts totaux des assurances maladies ont augmenté en moyenne de 3,8% entre 2013 et 2017 et les coûts totaux de santé s’élevaient à 82,5 milliards de francs suisses en 2017 (Pahud and Zufferey, 2019). Le rapport coûts de santé/produit intérieur brut a atteint 12,3% en 2017 (Bundesamt für Statistik BFS, 2019) mettant la Suisse au deuxième rang derrière les États- Unis sur l’échelle des dépenses de santé publique des pays issus de l'Organisation de Coopération et de Développement Économiques (OCDE).

Illustration 1 coûts liés à la santé dans des pays de l’OCDE en 2017. Source : FSO 2019.

Afin de renforcer les efforts au niveau national pour s’occuper de la charge liée aux maladies non-transmissibles comme l’hypertension, l’obésité, l’hyperglycémie, l’hypercholestérolémie Health expenditure in OECD countries, 2017^p

Health expenditure in relation to GDP

OECD – Health Statistics 2019

0% 5% 10% 15% 20%

SwitzerlandUSA France Germany Sweden Canada Norway Austria The Netherlands Belgium United Kingdom Finland Italy Spain Iceland Ireland

12,3 17,2 11,311,5 10,410,9 10,310,4 10,010,1 9,6

7,1 8,5

p provisional data

8,99,2 8,8

(6)

5

et la mort précoce, l’OMS a élaboré un plan d’action global 2013-2025 dont l’objectif est la prévention et le contrôle des facteurs de risque les plus pertinents incluant les comportements néfastes pour la santé décrits ci-dessus (World Health Organization WHO, 2013). Pour atteindre cet objectif, l’OMS a appelé ses États membres à formuler, implémenter, monitorer et évaluer des stratégies publiques appropriées. Elle demande que les gouvernements se responsabilisent afin de faciliter la promotion des comportements favorables à la santé sur l’agenda politique grâce à l’élaboration de stratégies nationales ainsi que de plans d’action nationaux (World Health Organization WHO, 2014).

Alors que des stratégies et structures nationales sont certainement nécessaires pour atteindre les objectifs de l’OMS, le progrès dans le domaine technologique pourrait être, quant à lui, à l’origine d’instruments inédits de prévention au niveau individuel. Ainsi, le champ de recherche des interventions liées aux changements comportementaux (behavior change interventions, angl.) a récemment été enrichi d’un nouveau sous-champ dit d’interventions digitales liées aux changements comportementaux (digital behavior change intervention, DBCI) qui est né du progrès fulminant des technologies de l’information et de la communication. Yardley et al. (2016) définissent les DBCIs comme étant des interventions qui utilisent la technologie digitale pour promouvoir et maintenir la santé par prévention primaire ou secondaire. Les DBCIs ouvrent de nouvelles opportunités pour faire effet sur la vie des gens et sur leurs habitudes quotidiennes au vu de l’influence croissante d’Internet et de ses réseaux sociaux. Selon des estimations récentes de Hootsuite et al. (2019), 58% de la population mondiale est connectée à Internet et 45% utilise activement les réseaux sociaux. Or, des différences régionales importantes existent : en Afrique, le taux de pénétration d’Internet est de 36% et le taux d’utilisateurs des réseaux sociaux de 17%, tandis qu’en Amérique et en Europe, le taux de pénétration d’Internet est de 78% pour l’Amérique et de 86% pour l’Europe, et le taux d’utilisateurs des réseaux sociaux est de 66% (Amérique) et 55% (Europe)

(7)

6

respectivement. Au niveau global, Facebook a été confirmé comme étant le plus grand réseau social du monde, avec plus de 2,4 milliards d’utilisateurs actifs au second trimestre 2019 (Facebook, 2019). Ainsi, l’idée selon laquelle les réseaux sociaux avec leurs interactions sociales répandues et l’engagement soutenu de leurs utilisateurs pourraient servir dans les DBCIs semble prometteuse. Pourtant, à ce jour, peu de revues systématiques de littérature et de méta-analyses ont examiné l’influence des réseaux sociaux sur la modification des comportements de santé (Maher et al., 2014; Williams et al., 2014; Laranjo et al., 2015). Alors que Laranjo et al. (2015) ont trouvé un effet positif des interventions recourant aux réseaux sociaux pour modifier les comportements de santé, Maher et al., (2014) et Williams et al.

(2014) n’ont trouvé que peu de preuves à ce sujet. Ces trois études passaient en revue toute la littérature scientifique sur ce sujet jusqu’à 2015. Depuis, malgré la croissance continue de l’influence d’Internet et des réseaux sociaux décrite auparavant et, malgré l’apparition d’une multitude d’applications de santé (health apps, angl.), de programmes de santé en ligne et d’accessoires techniques portables (wearables, angl.), aucune mise à jour sur ce sujet n’a pu être trouvée. Cette absence de mise à jour, couplée au fait que les coûts de santé ne cessent d’augmenter et que les comportements néfastes soient toujours répandus, a motivé la décision de mener ce travail. Cette revue systématique de littérature et méta-analyse tente ainsi de remédier aux lacunes scientifiques depuis 2015 en passant en revue toutes les preuves scientifiques existantes concernant les interventions visant à modifier les comportements de santé à l’aide des réseaux sociaux.

(8)

7

Illustration 2 mappemonde de la pénétration des réseaux sociaux en 2019 (source : Alexa/SimilarWeb, Vicenzo Cosenza/vincos.it)

(9)

8 Résumé du travail (français)

Modifications des comportements liés à la santé à l’aide des réseaux sociaux : revue systématique de littérature et méta-analyse.

Ce travail a été approuvé et supervisé par l’unité d’épidémiologie populationnelle (UEP) des Hôpitaux Universitaires de Genève qui fait partie de l’Institut Éthique Histoire Humanités.

L’UEP conduit des études interventionnelles dans le domaine des maladies chroniques, des comportements à risques de cancer et des maladies cardiovasculaires avec pour principal objectif la promotion de la santé de la population générale et la construction d’une banque de données populationnelles utile à la recherche. Depuis 1993, l’UEP mène une enquête et une étude de santé appelée « Étude ‘Bus Santé’ ». Dans le cadre de cette étude, des résidents Genevois adultes sont questionnés sur leur santé, leurs activités physiques et leur nutrition, et des mesures anthropométriques ainsi qu’un bilan sanguin sont effectuésⁱ.

L’influence croissante d’Internet et des réseaux sociaux amène à se poser la question de leur impact sur la santé des gens ainsi que leur potentiel dans le domaine de la santé : les réseaux sociaux, préexistants et devenus quasiment omniprésents dans les pays occidentaux, pourraient-ils servir d’intermédiaire efficace entre les institutions soignantes/médicales et leurs patients actuels ? Pourraient-ils jouer un rôle préventif en incitant les gens à adopter des comportements plus sains ? Alors que les interventions dites « standards » courent le risque de ne pas être adoptées de façon efficace et répétée par la population cible du fait qu’elles demandent souvent une première étape consciente et délibérée (ouvrir une brochure de santé pour suivre des recommandations, par exemple, ou mémoriser les conseils donnés par son médecin de famille), cet obstacle serait moindre, voire disparaîtrait, si l’intervention était transmise via un outil auquel l’utilisateur/l’utilisatrice accèderait quoi qu’il en soit. Il a été

ihttps://www.hug-ge.ch/medecine-premier-recours/unite-epidemiologie-populationnelle (dernier accès 27 août 2019)

(10)

9

décidé de procéder à cette revue systématique de littérature et méta-analyse dans le but d’obtenir une vue d’ensemble sur les données scientifiques existantes sur ce sujet. Cette dernière servira de base d’évaluation pour savoir si des interventions de santé à l’aide des réseaux sociaux seront envisageables et si le questionnaire actuel de l’étude Bus Santé sera éventuellement être modifié.

La première étape du travail consistait à parcourir différentes bases de données, des archives et fichiers digitaux ainsi que des journaux médicaux afin de se familiariser avec le sujet, de se procurer une vue d’ensemble et de s’assurer qu’un travail similaire n’ait pas déjà été mené récemment. Un protocole a ensuite été rédigé en suivant les recommandations de PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses)ⁱⁱ, un ensemble de recommandations standardisées pour l’élaboration des revues systématiques de littérature et méta-analyses. Par la suite, ce protocole a été enregistré sous PROSPERO, un fichier digital prospectif et international des revues systématiques de littératureⁱⁱⁱ. L’étape d’après consistait en l’élaboration d’une stratégie de recherche et son application aux bases de données Pubmed, Medbase, Embase et Web of Science. Une stratégie de recherche spécifique pour chaque base de données a été nécessaire au vu des différences informatiques entre ces dernières. L’objectif de la recherche était de trouver des articles qui appliquaient un réseau social dans leur intervention, qui se focalisaient soit sur l’activité physique, soit sur le tabagisme, soit sur la consommation d’alcool ou encore la gestion du poids corporel, qui incluaient comme population cible des adolescents ou/et des adultes et qui étaient des essais randomisés contrôlés (Randomized Controlled Trials, angl.). 3511 articles ont été recueillis. L’étape de sélection a été effectuée par deux investigateurs indépendants par le biais d’une plateforme informatique

ii http://www.prisma-statement.org/ (dernier accès 25 août 2019)

iii https://www.crd.york.ac.uk/prospero/ (dernier accès 25 août 2019)

(11)

10

(Covidence)^iv où les articles ont été sauvegardés avec rejet automatique des duplicatas au préalable. Lors de cette étape, les investigateurs ont suivi strictement le protocole et ses critères d’inclusion et d’exclusion. Lorsqu’un désaccord sur l’inclusion ou l’exclusion d’un article survenait, le groupe d’experts préalablement formé était contacté afin d’arriver à une décision unanime. 20 études ont été incluses dans cette revue systématique de littérature et deux méta- analyses ont été réalisées, groupées selon leur résultat primaire, soit le poids corporel soit l’activité physique. RevMan5.3, un outil informatique en téléchargement libre^v, a été utilisé pour les calculs statistiques et l’élaboration des graphiques. La conclusion du travail ouvre la perspective sur la future recherche scientifique en soulignant l’importance de répondre aux limitations décrites afin d’arriver à des preuves scientifiques à haute validité et généralisables.

Conclusion (français)

Les deux méta-analyses réalisées dans le cadre de cette revue systématique de littérature démontrent des effets modestes mais statistiquement significatifs en faveur des interventions recourant aux réseaux sociaux pour promouvoir des comportements favorables à la santé. Ces résultats attestent ainsi d’un certain potentiel des réseaux sociaux à aider les gens à adopter des comportements favorables à la santé, au moins en ce qui concerne l’activité physique et la gestion du poids corporel. Or, cette conclusion est à considérer avec prudence en raison de limitations inhérentes aux études examinées. Ces dernières ont presque entièrement été menées dans des pays occidentaux, étaient le plus souvent de petite envergure et concernaient le plus souvent une population à prédominance féminine bien éduquée. Ceci rend une transposition à l’ensemble de la population difficile. De plus, la durée d’études était limitée, ce qui ne permet pas de tirer de conclusions sur les implications à long terme. Plusieurs auteurs ont également

iv https://www.covidence.org/home (dernier accès 25 août 2019)

v https://community.cochrane.org/help/tools-and-software/revman-5/revman-5-download/installation (dernier accès 25 août 2019)

(12)

11

exprimé leur doute concernant le concept des essais randomisés contrôlés pour étudier l’efficacité des réseaux sociaux sur la modification des comportements de santé, déplorant leur caractère artificiel et restreint. Afin d’aboutir à une conclusion plus fondée et apte à être transposée à la population générale, les limitations décrites doivent d’abord être solutionnées par les futures recherches. Par ailleurs, il manque à l’état actuel des preuves scientifiques en ce qui concerne l’utilité des réseaux sociaux dans le domaine du tabagisme et de la consommation d’alcool. Pour ce qui est de l’étude Bus Santé, il est proposé d’élargir le questionnaire actuel portant sur l’utilisation d’Internet et des réseaux sociaux. Le recueil de données locales et prospectives pourrait se révéler précieux à l’avenir pour examiner l’influence à long terme d’Internet et des réseaux sociaux sur la santé de leurs utilisateurs/utilisatrices.

Discussion (français)

La présente revue systématique de littérature a démontré des effets modestes mais statistiquement significatifs en faveur des interventions de promotion des comportements favorables à la santé à l’aide des réseaux sociaux. Or, les limitations décrites empêchent une conclusion plus probante.

Le type d’étude utilisé dans les articles inclus dans cette analyse constitue la première problématique. Alors que les essais randomisés contrôlés sont considérés comme étant le gold standard de recherche, ils démontrent tout de même certains désavantages, notamment : leur caractère artificiel, un nombre de participants restreint, des coûts considérables, une durée souvent limitée, etc. Dans ce cas précis, particulièrement, certains critères de qualité sont difficiles voire impossibles à résoudre, notamment le principe en double aveugle : alors que la répartition des groupes pourrait se faire à l’insu des cliniciens, cela n’est pas possible pour les participants. Ces derniers sauront forcément s’ils se trouvent dans le groupe d’intervention

(13)

12

(accès au réseau social choisi pour l’intervention) ou le groupe de contrôle (pas d’accès à ce réseau social).

Une autre difficulté réside dans l’établissement des liens de causalité. Pour modifier un comportement de santé, il est généralement reconnu que l’approche de référence est multidisciplinaire et multidimensionnelle. Il s’avère difficile par la suite de décartelliser les différentes influences pour retrouver le ou les facteurs de réussite. Et la difficulté est accentuée par le fait que la variable indépendante ‒ les réseaux sociaux ‒ est elle-même multidimensionnelle : il existe de nombreux réseaux sociaux différents (Facebook, Twitter, Instagram, WeChat, TikTok, etc.), chacun doté de ses propres caractéristiques et accessibles via divers appareils (ordinateur, laptop, smartphone, smartwatch, etc.), et utilisé et influencé par un cercle de personnes indéterminé et illimité. Cela crée un environnement dynamique plein de (sous-)variables indépendantes et difficile à contrôler. Difficile, ainsi, de savoir si un résultat positif est lié au type de réseau social choisi, en raison des interactions spécifiques qui s’y sont déroulées, ou au type d’appareil utilisé pour s’y connecter.

Au vu des enjeux décrits, la question de savoir si un autre type d’étude serait mieux adapté pour analyser la présente question de recherche se pose. Les études observationnelles de type étude de cohorte et étude cas-témoin ne semblent pas appropriées. Ceci en raison du fait que la première est majoritairement utilisée pour trouver l’incidence d’une certaine maladie et/ou ses facteurs de risque et que la seconde présente des avantages avant tout en cas de maladies rares.

Une étude transversale ne semble pas judicieuse non plus dans la mesure où elle est le plus souvent utilisée pour trouver la prévalence d’une maladie. Des analyses qualitatives, en revanche, pourraient s’avérer utiles. En recueillant des données principalement par le biais d’entretiens, les chercheurs bénéficieraient d’un feedback vaste et direct de la part des participants quant à leur expérience individuelle, leur besoin, leur attitude et leur

(14)

13

comportement. Ce retour qualitatif aurait l’avantage de mieux cerner les différentes dimensions ci-décrites et les facteurs susceptibles d’influencer un certain comportement par rapport à la santé. Ainsi, d’éventuels liens de causalité se fonderaient sur une base plus claire et plus fiable.

Il est évident qu’une telle approche qualitative ne répondra pas aux plus hauts standards de qualité de la recherche scientifique mais sera toutefois d’une grande valeur et constituera une bonne base sur laquelle les autres types d’études menées ultérieurement pourront s’appuyer.

Hormis le débat sur le type d’étude le plus approprié afin de répondre à la présente problématique de recherche, un autre aspect mérite une attention toute particulière. Lorsque l’on souhaite utiliser les réseaux sociaux pour changer des comportements de santé, il est indispensable de prendre en compte au préalable le problème des fake news et de s’intéresser aux moyens de réduire au maximum leur influence. De façon générale, on décrit les fake news comme étant des informations fausses - délibérément ou pas – qui se répandent à travers les médias. Il semble évident que leur fréquence d’apparition et leur vitesse de dissémination ont été augmentées et facilitées avec l’arrivée d’Internet. L’émergence des réseaux sociaux semble avoir accéléré cette tendance, avec des utilisateurs qui sont à la fois des émetteurs et des récepteurs d’informations. Dans le meilleur des cas, les fake news ne présentent que des informations déplaisantes et nuisibles et il est souvent difficile de faire le tri entre les informations vraies et les informations erronées. Parfois, les fake news peuvent avoir des implications plus sérieuses, notamment quand il s’agit de fake health advices (faux conseils de santé). L’exemple récent d’un chef d’état se demandant, face aux médias, si l’ingestion d’un désinfectant ne pourrait pas protéger contre le virus COVID-19^vi serait une parfaite illustration de ce danger. Alors que les professionnels de santé ont tout de suite démenti cette hypothèse, elle s’est tout de même répandue rapidement, notamment sur les réseaux sociaux, augmentant

vi https://www.nytimes.com/2020/04/26/us/politics/trump-disinfectant-coronavirus.html

(15)

14

le risque qu’un cas réel d’ingestion de désinfectant se produise. Si l’on souhaite changer les comportements de santé à l’aide des plateformes sociales digitales, il paraît essentiel de maîtriser au mieux cette variable qu’est la fake news afin d’éviter qu’elle vienne faire obstacle à l’adoption d’un meilleur comportement de santé. Alors que les grands réseaux sociaux essayent de combattre la désinformation et les fake news de façon systématique à l’aide de sociétés de fact checking, de logiciels intelligents et d’informations fournies aux utilisateurs, l’éradication des fake news reste toutefois extrêmement ardue.

Suite à une proposition concrète permettant d’aborder et de réduire en grande partie les limitations et difficultés ci-décrites par rapport au présent travail et à ses différentes études, une analyse qualitative, basée sur le même sujet de recherche que le présent travail, pourrait se faire de la manière suivante : un réseau de médecins de famille, composé de plusieurs cabinets individuels ou de plusieurs cabinets groupés, se forme et tous se mettent d’accord sur l’implémentation d’une étude qualitative et prospective. Les participants à cette étude seront les patients récurrents du réseau nouvellement constitué. L’étude porte sur deux éléments : d’une part, sur un groupe fermé sur un réseau social (exemple : Facebook), d’autre part, sur un lieu de partage en ligne dédié aux professionnels de santé ainsi qu’à l’échange d’informations et le recueil des données (exemple : la plateforme logicielle Microsoft Teams). Les médecins participant se mettent d’accord sur la variable dépendante (ex. : perte de poids) et les variables indépendantes incluses dans le groupe Facebook (ex. : le partage de real news, sur le fil d’actualité du groupe, des dernières découvertes scientifiques et techniques en rapport avec la perte du poids).

Un groupe fermé sur Facebook signifie que la participation se limite à des personnes ayant reçu une autorisation préalable de participation de la part d’un administrateur du groupe. Dans cet exemple, tous les médecins participants auront le rôle d’administrateur du groupe. Ce rôle sera

(16)

15

multiple. D’une part, les administrateurs veilleront à l’élimination des fake-news qui pourraient surgir dans la dynamique de partage entre les participants du groupe. Les administrateurs partageront également des real news comme précédemment mentionné. D’autre part, les administrateurs jouent un rôle dans la promotion de la santé afin de soutenir les participants dans leur perte de poids, par exemple en lançant des petits défis sportifs hebdomadaires avec des rappels récurrents ou en partageant des success stories (avec l’accord des personnes concernées).

Comme mentionné précédemment, la deuxième partie de l’étude concerne la plateforme logicielle Microsoft Teams : elle servira, d’une part, à l’échange d’informations entre les personnels de santé, notamment via la fonction de live chat ou de vidéoconférence de groupe.

Cette plateforme servira également au recueil des données : un tableau Excel pourra être créé et les médecins pourront y enregistrer des données démographiques, l’évolution du poids des participants, les points essentiels qui ressortent des entretiens continus avec les participants, etc.

Par rapport à une étude randomisée contrôlée, une telle approche aurait l’avantage d’être moins coûteuse et permettrait de mieux décartelliser les différentes variables indépendantes en vue des feedbacks directs des participants. Elle pourra être maintenue pour une durée prolongée et le risque d’influence des fake news serait minimisé. Il est clair, à ce stade, qu’il ne s’agit là que d’une proposition de base avec un cadre général. Divers aspects ont encore besoin d’une évaluation plus profonde, notamment pour garantir la sécurité informatique et la protection des données des participants ainsi que la mise en place d’un groupe de contrôle approprié.

Pour revenir à la présente thèse, certains aspects limitent l’impact de ce genre de recherche et empêchent l’auteur de cette revue de parvenir à une solution. Sur un plan plus général, la question de recherche elle-même pourrait être remise en question en vue d’une limitation

(17)

16

d’application de domaine inhérente à cette dernière : même l’approche scientifique la plus appropriée ne parviendra pas à résoudre un problème général. Les personnes ne possédant pas de compte sur un des réseaux sociaux seront exclues des éventuels bénéfices trouvés en rapport avec la promotion des comportements favorables à la santé. Cette problématique de base est générationnelle : alors que la génération des « millennials » y est représentée à quasiment 100%, le nombre d’utilisateurs décroît continuellement au fil du temps, avec la génération X, la génération des baby-boomers et au-delà. Ainsi, les personnes âgées ne sont que rarement représentées sur les réseaux sociaux. Ceci est d’autant plus déplorable qu’il s’agit d’une tranche d’âge vulnérable chez qui la question d’une promotion optimale de la santé est omniprésente.

Cette problématique générationnelle pourrait être décrite, en termes simples, comme suit : ‘ne pas vouloir y participer’ et ‘ne pas savoir comment y participer’. Or, la problématique de base concerne également des aspects sociaux et géographiques, du type ‘vouloir mais ne pas pouvoir’ : pour les classes sociales vivant au-dessous du seuil de pauvreté et/ou dans des régions excentrées, le problème de base est l’absence d’accès à Internet. Ces personnes, considérées comme un groupe social vulnérable, seront de même exclues des éventuels bénéfices trouvés lors des futures recherches sur ces réseaux sociaux. Les résultats du présent travail et des futurs travaux dans ce domaine pourraient donc s’avérer utiles pour certains groupes de population et pour savoir comment y promouvoir la santé, tandis que pour d’autres groupes, la question de recherche ne sera pas applicable et devra être ajustée en conséquence.

(18)

17 1 Abstract

Context. Despite having been the target of global, national and local policies for at least two decades, adverse health behaviours such as physical inactivity, unhealthy dieting, smoking and excessive alcohol consumption are still widespread within populations and some are also on the rise (e.g. excess weight). Thanks to rapid progress in information and communication technologies, digital behaviour change interventions (DBCI) have been developed. With social media sites increasingly becoming part of people’s daily lives, there is a need to understand if, in the context of DBCI, they are effective and efficient in promoting behavioural change when compared to standard interventions.

Objective. To conduct a systematic review of the effectiveness and efficiency of interventions that use social networking sites (SNS) to change people’s health behaviours compared to standard interventions.

Materials and methods. Four databases were searched using pre-specified search strategies.

Studies included were randomized controlled trials (RCT) or RCT-like studies, focused on people 13 years of age or older, involved an SNS, had some sort of comparison, and assessed a primary outcome in the domain of physical activity (PA), diet, smoking, or alcohol consumption. Two independent reviewers conducted the study selection process. The Revised Cochrane risk-of-bias tool for randomized trials was used for quality assessment.

Results. 3511 potential studies were retrieved through a database search, of which 16 RCT and 4 RCT-like studies (4541 participants in total) were included in this review. Meta-analysis showed a modest, though significant, positive effect of SNS on PA with a summary effect of 0.17 standardized mean difference (SMD) (95%CI 0.04 to 0.31). Heterogeneity was high with an I² of 51%. We also showed a modest, though significant, positive effect of SNS on body weight (BW) with a summary effect of 0.31 SMD (95% CI 0.01 to 0.60, I² = 70%).

Conclusion. Our results show that SNS may be effective in helping people to adopt healthier behaviours, at least for PA and BW. The highly controlled and artificial nature of RCT may have induced an underestimation of the effects.

Protocol registration. This review was registered on PROSPERO (registration-ID 138872)

(19)

18 2 Introduction

Physical inactivity, tobacco use, unhealthy diet, and the harmful use of alcohol have been defined by the WHO as common, preventable risk factors that underlie most non- communicable diseases, eventually leading to key metabolic/physiological changes such as raised blood pressure, obesity, raised blood glucose and cholesterol-levels, and premature mortality (World Health Organization, 2019). Despite having been the target of global, national and local policies for at least two decades, these risk behaviours are still widespread within the population and some also are on the rise. The health benefits of exercise and PA are well established. However, in most western countries a large proportion of the population does not attain the weekly minimum quota of 150 minutes of moderate intensity sport or daily life activities or the minimum of 75 minutes of PA at high intensity recommended by the World Health Organisation WHO (World Health Organization WHO, 2015). For example, in Switzerland, despite extensive health promotion attempts by the Federal Office of Public Healthand the Federal Office of Sport, latest statistics show that 25% of the adult population is sedentary and 42.7% is overweight or obese, compared to 31.2% in 1992 (Pahud and Zufferey, 2019). Today, globally, one in four adults is not sufficiently active (World Health Organization WHO, 2018a) and worldwide obesity has nearly tripled since 1975 with more than 1.9 billion adults overweight – and 650 million of them obese - in 2016 (World Health Organization WHO, 2018b). Tobacco has been the target of public health policies for at least half a century (Lushniak et al., 2014). Although the latest global report shows a global smoking rate reduction of 22%, this is still short of the voluntary global target established by the WHO of a 30% reduction between 2010 and 2025 (World Health Organization WHO, 2018d).

Tobacco still kills more than 8 million people each year, either as a result of direct tobacco use or through exposure to second-hand smoking. On the same page, it is estimated that annually 3 million deaths worldwide result from the harmful use of alcohol, representing 5.3% of all

(20)

19

deaths, a number that leaps to 13.5% for the age group 20-39 years (World Health Organization WHO, 2018b, 2019).

The still widespread prevalence of adverse health behaviours generates important costs related to the treatment of related non-communicable diseases and the consequent loss of human capital (World Health Organization, 2019). Total health spending is growing faster than gross domestic product, increasing more rapidly in low- and middle-income countries (close to 6%

on average) than in high income countries (4%) (Xu et al., 2018). In order to strengthen national efforts to address the burden of non-communicable diseases, the WHO has framed a global action plan for the prevention and control of several key risk factors, including the harmful health-related behaviours described above, with a time horizon up to 2025 (World Health Organization WHO, 2013). On the national level, the WHO calls on its member countries to formulate, implement, monitor and evaluate appropriate public policies. It calls for political commitment from governments to facilitate health behaviour promotion on the political agenda with subsequent national policies and action plans, giving the example of a national PA plan with specific goals and a strategic plan for action (World Health Organization WHO, 2014).

Although structural and nationally-led policies are certainly necessary to reach the targets set by WHO, technological advances may also contribute to opening the way to new prevention tools at the individual level. The field of behaviour change interventions (BCI) has recently been enriched by a sub-area called digital behaviour change intervention (DBCI) emerging from today’s rapid progress in information and communication technologies. DBCI is defined by Yardley et al. (2016) as an intervention that employs digital technology to promote and maintain health through primary or secondary prevention. DBCI opens up new opportunities to influence people’s lives and daily habits, especially in light of the internet’s steady rise in influence and its connected use of social networking sites. According to recent

(21)

20

estimates (HOOTSUITE et al., 2019),58% of the worldwide population is connected to the internet and 45% of the total worldwide population are active social media users, although there are important regional differences (in Africa the internet penetration is at 36% and the active social media users’ rate at 17%, versus 78% and 86% for America and Europe for internet penetration and 66% and 55% for social media use, respectively). On a global level, Facebook has been confirmed as the biggest social network worldwide with 2.41 billion monthly active users as of the second quarter of 2019 (Facebook, 2019).

There are few systematic reviews and meta-analyses that have assessed the influence of social networking sites (SNS) on health behaviour change (a clear definition of our understanding of an SNS is given in the methods section). Whereas Laranjo et al. (2015) found a positive effect of SNS interventions on health behaviour change, Maher et al. (2014) and Williams et al.

(2014) concluded on very modest evidence in that matter. These reviews summarized evidence up to 2015. However, despite the continued growth and influence of SNS on people’s daily lives described above, and the emergence of a myriad of health-related applications, wearables, and online programmes for behaviour change in recent years, scant scientific update has been found on this subject. At a time when health costs are rising inexorably while adverse health behaviours are still widespread and the potential of new digital technologies to address these problems is fragmentary, we think it is essential to look for new evidence on this matter. This review attempts to fill this gap by updating the existing literature on SNS interventions for health behaviour change through a systematic review and meta-analysis.

3 Methods

For this review, we followed the PRISMA-Equity guidelines (Moher et al., 2009). After the stage of preliminary searches and the wording of the research question, we registered our

(22)

21

review on PROSPERO, an international prospective register of systematic reviews (National Institute for Health Research NHS, 2019), checking simultaneously for similar ongoing reviews. The registration-ID of the present record is 138872, accessible through the official Prospero website (https://www.crd.york.ac.uk/prospero/, last access 14 october 2019 - as at 14 October 2019, it is still being assessed, and is thus not yet accessible for everybody).

The database search was performed in June 2019 by searching Pubmed/Medbase, Embase and Web of Science (also known as Web of Knowledge). Since in-depth systematic reviews were published in 2014 (Maher et al., 2014; Williams et al., 2014) and 2015 (Laranjo et al., 2015) we searched for articles published after 2014, written in English, German, and French. If the full text was not available or data were missing, the authors were contacted twice. If unsuccessful, efforts were abandoned and noted accordingly. Details of the electronic search strategy applied to Pubmed/Medbase are available as a supplement (c.f. Appendix1).

As mentioned above, our study population consisted of adolescents and adults, i.e. people beyond twelve years of age, regardless of their health status. Eligible studies needed to involve an SNS – defined in the next section - either stand-alone or as part of a bigger, multi-channelled intervention. We kept the definition of SNS broad in order to not miss important studies. Thus, we not only searched for social media sites such as Facebook, Twitter, Instagram, WhatsApp, and YouTube in western countries, but also for blogs, health-specific SNS, and SNS used elsewhere outside Europe, including WeChat, QQ, QZone, and VKontakte. Simple, not specified mobile health apps (also known as mHealth, eHealth), web-based programmes without the possibility of interaction through a digital platform, and real life and offline social networks were excluded as these did not match our research question. Any kind of comparison was accepted, for example the comparison with another SNS intervention group, with an

(23)

22

offline social network, traditional interventions (web-based, paper-based, telephone, TV, radio, billboard advertising, personal contact), or a within subject-comparison in the case of pre-post study designs. The assessed outcome was health behaviour change restricted to the domains of PA, diet/weight management, smoking, and alcohol consumption (Ashton et al., 2017). Other common outcomes in relation to SNS such as recruiting, advertising and education were excluded. In this review, we focused on Randomized Controlled Trials (RCT) as our preferred study design in order to maximize the level of evidence, setting a minimum number of ten eligible RCTs as a benchmark. Had this not been reached, a second database search with a subsequent second screening process would have been conducted in order to also include other types of studies, such as quasi-experimental designs and before-and-after studies.

3.1 Definition of social networking sites (SNS)/social media and Web 2.0

In this review we used the definition of social media and the closely related term of Web 2.0 given by Kaplan et al. (2010). According to them, Web 2.0 describes a new way of using the World Wide Web: content on the internet is no longer created and published by individuals, but instead is continuously modified by all users in a participatory and collaborative fashion.

Social media is then looked at as a group of internet-based applications that build on the ideological and technological foundations of Web 2.0 allowing the creation and exchange of user generated content.

3.2 Study selection and data extraction

The entirety of references obtained through a database search was imported into Endnote, a software tool for publishing and managing bibliographies, citations and references (www.endnote.com, last access 14 October 2019). Endnote represented our backup online platform where a first exclusion of duplicates took place. The remaining references were then

(24)

23

imported into Covidence, a screening and data extraction tool created in collaboration with the Cochrane organisation (www.covidence.org, last access 14 October 2019). The subsequent study selection process was performed by two independent reviewers (AMR, CDM) in two stages: an initial screening of titles yielding potentially relevant papers was followed by a full text reading. In order for an RCT to be included, the review team had to agree. If disagreement occurred, an expert team (SS, DP) was consulted and the final decision was taken together. A flowchart gives an overview of the number of studies that remained at each selection stage including the reasons for exclusion (Figure 1).

Data extraction was performed by the main author (AMR) and proofread by the second reviewer (CDM). For each study, the following data were extracted: first author, title, study region/country, target population and their inclusion/exclusion criteria, number of participants, characteristics of intervention group (IG) and SNS used, characteristics of control group (CG), primary outcome, and study duration. The extracted data were arranged in an excel sheet to give readers a concise yet complete overview of the studies included (c.f. Table 1). Following the reasoning and practice of Williams et al. (2014), when there were more than two study arms, we extracted data from the arm with the least intervention as the control group (e.g.

waitlist group, control group, group without social media component, etc). Likewise, when there were multiple intervention groups involving social media, we extracted data for the group with the fewest co-interventions in an attempt to examine the specific impact of social media.

Subsequently, the results for primary outcomes along with statistical evidence (mean, standard deviation/standard error, 95% confidence interval, etc.) and the author’s conclusions and the limitation of the included studies were extracted and summarized in a second excel sheet (c.f.

Table 2).

(25)

24 3.3 Quality assessment

We applied the Revised Cochrane risk-of-bias tool for randomized trials RoB2 (Sterne et al., 2019) to assess the risk of bias in the findings of included randomized controlled trials, summarized in Table 3. The tool builds up on five domains through which bias might be introduced into the result: (1) bias arising from the randomization process, (2) bias due to deviations from intended interventions, (3) bias due to missing outcome data, (4) bias in measurement of the outcome, and (5) bias in selection of the reported result. Each domain had signalling questions that needed to be answered with ‘yes’, ‘probably yes’, ‘no’, ‘probably no’, or ‘no information’. For each domain, an algorithm was provided, which had to be followed according to the given answers, leading to one of three risk categories of bias: (A) low risk of bias, (B) some concerns, and (C) high risk of bias. The default overall judgement for the RCT examined became high risk of bias if at least one of the 5 domains was judged as being at high risk of bias. In that case we still continued and completed the assessment of the study, following the reasoning of Sterne et al. (2016), so that complete evaluation might be available which might serve in the design and conduct of future studies.

3.4 Data synthesis

We decided on a hybrid approach by conducting a meta-analysis when appropriate and applying a narrative synthesis otherwise. Whenever possible, intention-to-treat results were used for statistical analysis. To give readers a graphic overview of the results of the included studies, the continuous primary outcomes were summarized in Figure 2. Two meta-analyses were conducted for continuous outcomes using the statistical tool of Review Manager 5 (RevMan5.3), a software application used for preparing and maintaining Cochrane Reviews (The Cochrane Collaboration, 2019). We pooled the studies with PA as a primary outcome, and conducted a first meta-analysis with effect size calculated as the principal summary

(26)

25

measurement by using standard mean difference and Mantel-Haenszel methods for combining results across studies. Heterogeneity was assessed using the I² statistic. Since there was substantial heterogeneity in intervention effects, we used a random-effects analysis model where the true effects in the studies are assumed to vary between studies with the summary effect being the weighted average of the effects reported in the different studies (Borenstein et al., 2009). Subgroup analysis was performed to see if heterogeneity could be minimized considerably by removing certain studies. Likewise, a second meta-analysis was conducted by pooling studies with BW as their primary outcome. To better understand the unit-less effect size of the meta-analysis, we re-expressed the latter by applying the calculated SMD back into one of the original studies and depicted it on the scale used in that study following the instructions of Higgins and Green (2011). This was done by multiplying the standard deviation of the control group by the pooled SMD (i.e. effect size), with the resulting number being the mean difference for the scale used in the chosen representative study. RevMan5.3 was also used to execute the transformation between standard deviation (SD), standard error (SE) and the 95% confidence interval (CI), since the authors of included studies presented their results differently. In order to pool differently assessed outcomes (Figure 2), we standardized results to a uniform scale following the advice of Higgins and Green (2011). For example, body weight outcomes were multiplied by -1 to match the direction of physical activity outcomes.

The possibility of publication bias was assessed by conducting a funnel plot of the standard mean differences and evaluating the latter for asymmetry. Although visual evaluation might be prone to subjectivity, with different observers interpreting funnel plots differently, an Egger’s test was not performed. According to Borenstein et al. (2005) the power of this test is low unless there is severe bias or a substantial number of studies - which was not the case in this review. For the second meta-analysis, neither funnel-plot analysis nor the Egger-test was

(27)

26

performed, following the recommendation of Higgins and Green (2011) not to do so for fewer than ten included studies, since the power of the test would become too low to distinguish chance from real asymmetry.

We chose a narrative approach to present studies where data were missing or could not be integrated into the meta-analysis (e.g., dichotomous outcome).

4 Results

4.1 Study selection

Figure 1 shows the flow diagram of study selection. The database search yielded 3511 studies of which 139 were inspected in their entirety for potential inclusion. Twenty studies met our inclusion criteria, including one identified through the scanning of reference and citation lists of relevant papers.

4.2 Study characteristics

Table 1 presents the included studies and their characteristics. The studies were published between 2015 and 2019 and involved a total of 4,541 participants, the majority of whom were young adults of predominantly white non-Hispanic origin, with an above-average level of education (tertiary education). Most studies included a predominantly female population. Only two studies showed a balanced sex distribution (Irwin et al., 2016; Krishnamohan et al., 2017) and two had an exclusively (Ashton et al., 2017) or predominantly (Cheung et al., 2015) male sample.

(28)

27

Sixteen studies (Maher et al., 2015; Rote et al., 2015; Ruotsalainen et al., 2015; Schneider et al., 2015; Zhang et al., 2015, 2016; Godino et al., 2016; Hales et al., 2016; Irwin et al., 2016;

Rovniak et al., 2016; Balk-Møller, Poulsen and Larsen, 2017; Jane et al., 2017; Kolt et al., 2017; Looyestyn et al., 2018; Mascarenhas et al., 2018; Kernot et al., 2019) were RCTs, the remaining four being a cluster RCT (Cheung et al., 2015), a pilot RCT (Ashton et al., 2017), a randomized ecological trial (Vandelanotte et al., 2017), and a non-RCT (Krishnamohan et al., 2017).

The study duration ranged from a one-day intervention (Irwin et al., 2016) to a two-year intervention (Godino et al., 2016) with most studies lasting between two and six months. Nine studies were conducted in the US (Rote et al., 2015; Schneider et al., 2015; Zhang et al., 2015, 2016; Godino et al., 2016; Hales et al., 2016; Irwin et al., 2016; Rovniak et al., 2016;

Mascarenhas et al., 2018) and seven in Australia (Maher et al., 2014, 2015; Ashton et al., 2015;

Jane et al., 2017; Kolt et al., 2017; Looyestyn et al., 2018; Kernot et al., 2019). The remaining studies were conducted in Finland (Ruotsalainen et al., 2015), Denmark (Balk-Møller, Poulsen and Larsen, 2017), India (Krishnamohan et al., 2017), and Hong Kong, China (Cheung et al., 2015). The targeted health domains generally included PA (Maher et al., 2015; Rote et al., 2015; Schneider et al., 2015; Zhang et al., 2015, 2016; Irwin et al., 2016; Rovniak et al., 2016;

Ashton et al., 2017; Kolt et al., 2017; Krishnamohan et al., 2017; Vandelanotte et al., 2017;

Looyestyn et al., 2018; Mascarenhas et al., 2018; Kernot et al., 2019), along with BW (Ruotsalainen et al., 2015; Godino et al., 2016; Hales et al., 2016; Balk-Møller, Poulsen and Larsen, 2017; Jane et al., 2017) and smoking (Cheung et al., 2015).

4.3 Intervention and adherence/retention rates

Eleven studies used Facebook as their SNS, either as a stand-alone instrument(Cheung et al., 2015; Maher et al., 2015; Rote et al., 2015; Ruotsalainen et al., 2015; Jane et al., 2017;

(29)

28

Krishnamohan et al., 2017; Looyestyn et al., 2018) or as part of a more complex, multi- component intervention (Godino et al., 2016; Rovniak et al., 2016; Ashton et al., 2017; Kernot et al., 2019). Where Facebook was used as a stand-alone intervention, it was most often guided by a person from the study intervention staff who acted as a group administrator (Rote et al., 2015; Ruotsalainen et al., 2015; Jane et al., 2017; Krishnamohan et al., 2017; Looyestyn et al., 2018)with the aim of facilitating online group interaction, motivation, information, control, and feedback. Eight studies designed their own, health-specific SNS (Schneider et al., 2015;

Zhang et al., 2015, 2016; Hales et al., 2016; Irwin et al., 2016; Balk-Møller, Poulsen and Larsen, 2017; Kolt et al., 2017; Vandelanotte et al., 2017)which were laid out at the same time on social support and on social comparison, including diverse incentive structures such as:

team-building stimuli (Maher et al., 2015; Schneider et al., 2015; Hales et al., 2016; Irwin et al., 2016), periodic feedback on individual and/or team progress (Zhang et al., 2015, 2016;

Rovniak et al., 2016; Balk-Møller, Poulsen and Larsen, 2017; Looyestyn et al., 2018; Kernot et al., 2019), goals page (Rote et al., 2015; Hales et al., 2016), and system of digital rewards (Maher et al., 2015; Balk-Møller, Poulsen and Larsen, 2017; Kernot et al., 2019) or real rewards (Schneider et al., 2015; Hales et al., 2016). The remaining SNS utilized were WhatsApp (Cheung et al., 2015) and Google Hangout (Mascarenhas et al., 2018), the latter having the option of videoconferencing enabling simultaneous exercise for participants.

When the SNS was part of a multi-component intervention, the other elements applied comprised daily or weekly SMS and/or email services (Maher et al., 2015; Schneider et al., 2015; Rovniak et al., 2016; Kolt et al., 2017), web-based health information (Zhang et al., 2015, 2016; Godino et al., 2016), audio-visual messages (Krishnamohan et al., 2017), digital advice from physiotherapists and dietitians (Ruotsalainen et al., 2015; Krishnamohan et al., 2017), print-based health information (Cheung et al., 2015), paper logs for self-recording (Rote et al., 2015), in-person health coaching sessions (Godino et al., 2016), offline group meet-ups

(30)

29

(Schneider et al., 2015; Rovniak et al., 2016), and exercise classes (Zhang et al., 2015, 2016;

Looyestyn et al., 2018).

There was a large variability across the studies in terms of the retention rate of participants.

For example, Irwin et al. (2016)reported a 100% retention rate whereas Vandelanotte et al.

(2017)reported a retention rate below 6%. Further information about retention rates is given in Table 2.

4.4 Quality assessment

Table 2 gives a detailed summary of the principal findings responsible for the resultant category of risk (low risk/some concerns/high risk) applied to the five domains examined (randomization process, deviations from intended interventions, missing outcome data, measurement of the outcome, selection of the reported results). For the sake of completeness, the four included studies previously described as not being full RCTs (a cluster RCT (Cheung et al., 2015), a pilot RCT (Ashton et al., 2017), a non-RCT (Krishnamohan et al., 2017), and a randomized ecological trial (Vandelanotte et al., 2017)) were also assessed, ending up foreseeably with a judgement of a high risk of bias. Of the 16 RCTs, only one was judged to be at low risk of bias (Kernot et al., 2019). Seven studies were judged as having some concerns (Rote et al., 2015; Zhang et al., 2015, 2016; Godino et al., 2016; Hales et al., 2016; Rovniak et al., 2016; Kolt et al., 2017), with none of them specifying a clear allocation concealment.

The remaining eight studies were judged to have a high risk of bias for various reasons:

completers-only analysisinstead of intention-to-treat analysis (Sterne et al., 2011; Ruotsalainen et al., 2015; Balk-Møller, Poulsen and Larsen, 2017; Jane et al., 2017; Mascarenhas et al., 2018); self-reported outcomes instead of objectively measured outcomes (Maher et al., 2015;

Krishnamohan et al., 2017; Looyestyn et al., 2018; Mascarenhas et al., 2018); group

(31)

30

differences in baseline assessment (Schneider et al., 2015; Balk-Møller, Poulsen and Larsen, 2017); and faulty randomization process (Hales et al., 2016).

4.5 Comparison and outcomes

Six studies had passive control groups; of which two had no intervention at all (Ruotsalainen et al., 2015; Balk-Møller, Poulsen and Larsen, 2017), and four had a waitlist control group for comparison (Maher et al., 2015; Ashton et al., 2017; Mascarenhas et al., 2018; Kernot et al., 2019). The remaining fourteen studies showed active control groups with diverse designs:

whereas Hales et al. (2016) chose a standard, commercially available app for comparison, Rote et al. (2015)set up a “standard walking intervention” to be followed by the control group. Six studies (Zhang et al., 2015, 2016; Godino et al., 2016; Irwin et al., 2016; Krishnamohan et al., 2017; Vandelanotte et al., 2017) chose to give the control group access to basic web 1.0 features, whereas other six chose to deliver health information, guidelines, and recommendations via email and paper-based media to the comparison group (Cheung et al., 2015; Schneider et al., 2015; Rovniak et al., 2016; Jane et al., 2017; Kolt et al., 2017;

Looyestyn et al., 2018). All included studies had at least one comparison group with no SNS features. Two studies had multiple SNS in their intervention: whereas Cheung et al. (2015) chose to set up two separate SNS intervention groups - Facebook and WhatsApp - Rovniak et al. (2016)decided to expose the three town sites of their study to different SNS (2x Facebook, 1x health-specific SNS) but merged them into one single SNS intervention group for analysis.

The vast majority of studies had PA as their primary outcome, broken down into: steps/day (Rote et al., 2015; Ashton et al., 2017; Kernot et al., 2019), moderate to vigorous PA (MVPA) (Maher et al., 2015; Rovniak et al., 2016; Kolt et al., 2017; Looyestyn et al., 2018;

Mascarenhas et al., 2018), unspecified minutes/week or minutes/day (Krishnamohan et al.,

(32)

31

2017; Vandelanotte et al., 2017), and exercise class attendance rates (Zhang et al., 2015, 2016).

BW was the primary outcome for four studies (Ruotsalainen et al., 2015; Godino et al., 2016;

Hales et al., 2016; Jane et al., 2017), and one focused on smoking relapse rate as its primary outcome (Cheung et al., 2015).

The majority of studies reported an objective assessment of their primary outcome (Rote et al., 2015; Ruotsalainen et al., 2015; Schneider et al., 2015; Zhang et al., 2015, 2016; Godino et al., 2016; Irwin et al., 2016; Rovniak et al., 2016; Balk-Møller, Poulsen and Larsen, 2017; Jane et al., 2017; Kolt et al., 2017; Kernot et al., 2019) compared to eight studies where the outcome was either self-reported (Cheung et al., 2015; Maher et al., 2015; Ashton et al., 2017;

Krishnamohan et al., 2017; Vandelanotte et al., 2017; Looyestyn et al., 2018; Mascarenhas et al., 2018) or not specified (Hales et al., 2016).

4.6 Effectiveness of social media

Although many studies were judged as having a high risk of bias, none were excluded based on their quality assessment. Instead, it was decided to clearly label the risk of bias of the individual studies in the following paragraph and the corresponding figures, allowing the readers to quickly assess and recall the quality assessment described earlier. Consequently, studies with some concern are labelled with one asterisk (*) whereas studies with a high risk of bias are labelled with two (**). Studies not showing any asterisk were judged to be at a low risk of bias.

Table 3 shows the results for primary outcomes with its author’s conclusions and its stated limitations. Thirteen studies found no effect of the SNS on the intervention group compared with the control group(Maher et al., 2015**; Ruotsalainen et al., 2015**; Schneider et al., 2015**; Zhang et al., 2015*, 2016*; Godino et al., 2016*; Rovniak et al., 2016*; Ashton et

(33)

32

al., 2017**; Kolt et al., 2017*; Krishnamohan et al., 2017**; Looyestyn et al., 2018**;

Mascarenhas et al., 2018a**; Kernot et al., 2019). Six studies reported a positive effect of SNS (Rote et al., 2015*; Hales et al., 2016*; Irwin et al., 2016**; Balk-Møller, Poulsen and Larsen, 2017**; Jane et al., 2017**; Vandelanotte et al., 2017**). One study that compared two different SNS study arms to a control group, detected no effect in the Facebook arm and a positive effect in the WhatsApp arm (Cheung et al., 2015**).

Figure 2 provides a general overview of the effect sizes of primary, continuous outcomes with the median effect size being 0.23 in favour of the intervention group (Zhang et al., 2015*).

Effect sizes ranged from -0.29 SMD in favour of the control group (Krishnamohan et al., 2017**) to 0.83 SMD in favour of the intervention group (Jane et al., 2017**). There were four studies with significant differences for primary outcomes favouring the intervention group (Hales et al., 2016**; Irwin et al., 2016**; Jane et al., 2017**; Rote et al., 2015*). Due to its dichotomous primary outcome, the study of Cheung et al. (2015**) was not included in this first figure. Likewise, the studies of Rovniak et al. (2016) * and Schneider et al. (2015)** were not included due to missing data and the non-response of the authors.

4.7 Meta-analysis

Twelve studies (2775 participants) with PA as their primary outcome were pooled in a meta- analysis shown in Figure 3. A modest, though significant positive effect of SNS on PA was found with a summary effect of 0.17 SMD (95%CI 0.04 to 0.31). Despite the large statistical heterogeneity (Tau²= 0.02, Chi²= 22.27, df = 11 [P=0.02], I² = 51%), the random effects meta- analysis showed a statistically significant beneficial effect. A subsequent subgroup analysis showed a drop of heterogeneity parameters when the studies of Irwin et al. (2016) and Rote et al. (2015) were excluded (Tau2=0.00, Chi2=11.11, df=9 [P=0.27],I2=19%), with the summary

(34)

33

effect decreasing to 0.10 SMD (95% CI 0.00 to 0.20). The 0.17 SMD translates to an improvement of 563.75 steps/day in the intervention group relative to the control group.

Five studies (465 participants) having BW as the primary outcome were pooled in a second meta-analysis shown in Figure 4. A modest, though significant, positive effect of SNS on BW was found with a summary effect of 0.31 (95% CI 0.01 to 0.60) SMD. Likewise, despite the large heterogeneity (Tau²= 0.06, Chi²= 13.21, df = 4 [P=0.01], I²= 70%), the random effects meta-analysis shows a statistically significant beneficial effect. The 0.31 SMD translates to a 0.95kg mean weight loss in the intervention group relative to the control group.

4.8 Publication bias

The funnel plot revealed no indication of publication bias (Figure 5). Egger’s test was not performed, and publication bias analysis was not applied to the second meta-analysis on the effect of body weight because fewer than ten studies were included (Higgins and Green, 2011).

4.9 Narrative approach regarding studies not included in a meta-analysis

Rovniak et al. (2016)* and Schneider et al. (2015)** could not be included in our statistical analysis due to missing outcome values and the non-response of the main authors contacted for that matter. Rovniak et al. (2016)* and Schneider et al. (2015)** both assessed PA as their primary outcome over a period of 12 weeks and 6 months respectively. Both studies concluded that all groups improved their PA from the baseline with no significant between-group differences, making the control-design just as effective as the intervention. Both studies reported homogeneity of their study sample as their primary limitation (female, white, non- Hispanic and with above average education).

(35)

34

As mentioned earlier, the study of Cheung et al. (2015)** was listed separately due to its dichotomous outcome. The authors assessed Smoking Relapse Rate (SRR) as their primary outcome over a period of six months, comparing two SNS intervention groups (IG) to a standard control group (CG) designed to receive only a self-help booklet for smoking cessation maintenance. The WhatsApp IG significantly reduced its smoking relapse rate, probably because of enhanced discussion and social support. However, the effect dissipated after the social groups closed. The Facebook IG showed an insignificantly lower relapse rate compared to the CG.

5 Discussion

The present comprehensive systematic review updated the scientific evidence on the efficacy of social networking sites as possible intervention instruments for health behaviour change.

Results with a significant, though modest, summary effect in favour of SNS-based interventions to promote increased PA and balanced BW were found.

5.1 Advantages / drawbacks of Social Networking Sites for health behaviour change

In some respects, SNS seem to clearly outclass standard intervention while showing drawbacks in others. Some of the major advantages of SNS undoubtedly lie in their ability to be accessible at any time, to bridge geographical boundaries and thus open up the possibility of reaching minorities or difficult-to-reach groups, to connect users with likeminded individuals when offline support is not available, and in their capacity to manage large caseloads in a time- effective manner (Jane et al., 2017). It has been suggested that their potential could be enhanced if ready-made platforms (e.g. Facebook) were used, which would reduce costs further and provide health professionals with access to existing social networks (Jane et al., 2017).