Classwork 8

Databases - Social Media Analytics

Author

Byeong-Hak Choe

Published

October 20, 2025

Modified

October 20, 2025


Social Media Survey

๐ŸŽฏ Purpose of this Activity

Extract data from you (students)

By submitting the survey, you generate raw rows in a database table โ€” just like how platforms quietly log every interaction you make.

  • Behind the scenes, each of your clicks becomes a database entry โ€” a row born from your activity.
  • The survey is anonymous โ€” we analyze patterns, not individuals.

Social Media Analytics

ETL Workflow

  • Your role: You submit the survey โ€” that generates raw data rows, just like a platform logging user behavior.

  • My role: I act as the data system. As we walk through the results, Iโ€™ll point out where I use
    filter(), select(), and left_join() to Transform your responses and Load a clean analysis table we can analyze.

  • Big idea: Your responses become a relational dataset, and then we join in algorithm rules โ€” the same way TikTok, Instagram, or Reddit enrich user logs before ranking your feed.

Feed Algorithm

โ€œDo you think your feed reflects your choices โ€” or the platformโ€™s choices about you?โ€

  • Weโ€™ll look at platform behavior patterns using your data:
    minutes, posting activity, device type, and interaction style.
  • Then weโ€™ll run an algorithm simulation using a random card draw system to show how the same minutes can be valued very differently depending on the platformโ€™s ranking logic.

๐Ÿง  Feed Algorithm Interpretation Using Card Mechanics

To illustrate how platforms differ in engagement efficiency, we treat each minute of screen time like drawing a card:

  • Rank (2โ€“10, J, Q, K, A) โ†’ how strong your engagement signal was
  • Suit (platform personality) โ†’ how that platform interprets and amplifies that engagement

Youโ€™ll see that not all platforms reward the same behavior equally โ€” some love viral spikes, some love steady engagement.

๐ŸŽญ Platform Algorithm Personalities โ€” Rank Bias Model

Each platform doesnโ€™t just apply a fixed multiplier โ€” it biases specific rank categories (Face, Ace, Mid, etc.).
This models how TikTok boosts viral spikes, Reddit rewards slower thread engagement, and YouTube locks users into deep watch sessions.

Platform Algorithm Personality Rank Preference Pattern
TikTok โ™  Strategic viral retention โ€” aggressively hunts viral spikes Face โ†’ Ace โ†’ 10 (very spike-oriented, punishes low/no engagement)
Instagram / Threads โ™ฆ Aesthetic/status curation โ€” polished trend amplification 10 โ†’ Face โ†’ Mid (reward polished content, suppress low-value scroll)
YouTube โ™  Strategic deep retention โ€” optimizes long session depth Ace โ†’ 10 โ†’ Mid (not chaotic, but strong on intentional consumption)
Reddit / Discord โ™ฃ Community depth โ€” favors threads, steady engagement over spikes Mid โ†’ Ace โ†’ 10, downweights Face spikes (less about virality)
Snapchat โ™ฅ Emotional burst algorithm โ€” high volatility, short attention loops Face spikes allowed, but low ranks common (chaotic scroll-emotion mix)
X/Twitter โ™ฃ Fast discourse loop โ€” rewards mid-level interaction & repost energy Mid + Face modest bias, suppresses dead scrolling
Facebook โ™ฆ Legacy ranking โ€” mild engagement shaping, mostly neutral Slight mid bias, nearly flat weighting otherwise
None Neutral baseline โ€” no algorithm shaping All rank_adj = 1.00 (control group)

๐ŸŽฎ Rank Interpretation (Base Weights Before Platform Influence)

Rank Category Base Weight Meaning in Attention Model
Joker (0) 1.00 Passive drift โ€” counted time, no engagement signal
2โ€“4 (Low ranks) 1.00 Glance-scroll, content seen without response
5โ€“9 (Mid ranks) 1.05 Light scrolling or minor interaction โ€” retention signal
10 1.10 High interest moment โ€” actively watching or reading
Ace (A) 1.25 Intentional engagement โ€” searching, saving, deep dwell
Face Cards (J/Q/K) 1.50 Emotional/viral spike โ€” strong reaction triggers

๐Ÿšฉ Raw minutes = counting cards without considering rank or suit.
๐ŸŽฏ Algorithm-adjusted attention = calculating your true score based on both rank (quality of engagement) and suit (type of platform algorithm).

In other words:

  • A 20-minute session on Instagram Reels (โ™ฆ Ace) scores high adjusted attention.

  • A 40-minute YouTube autoplay binge (โ™  Joker) has high raw minutes but almost no algorithm-weighted value.

  • Two users can spend the same time, but their card draws (attention efficiency) differ by platform.

    • TikTok may reward spike engagement, while Reddit rewards steady browsing.
    • YouTube may ignore your first 10 minutes, but heavily weight the moment you intentionally engage.

โœ… Key Takeaway

Algorithms donโ€™t just count your time โ€” they interpret and score it.
Some platforms are time sinks, while others are attention amplifiers โ€” even if the minutes look the same.

Questions

  1. If platforms score and rank your behavior in secret, should users be allowed to know their score โ€” or is it okay for the system to stay hidden?

  2. If a platform is designed to keep you scrolling even when the content is low-value, is it ethical to design for addiction instead of meaningful use?



Discussion

Welcome to our Classwork 8 Discussion Board! ๐Ÿ‘‹

This space is designed for you to engage with your classmates about the material covered in Classwork 8.

Whether you are looking to delve deeper into the content, share insights, or have questions about the content, this is the perfect place for you.

If you have any specific questions for Byeong-Hak (@bcdanl) or peer classmate (@GitHub-Username) regarding the Classwork 8 materials or need clarification on any points, donโ€™t hesitate to ask here.

Letโ€™s collaborate and learn from each other!

Back to top