Unlock the Potential of Social Media Data

Srikanth Katasani

Principal Consultant, Data Analytics

Hemanth Kumar Gandham

Associate Consultant

Client: BioPharma
Year: 2018

Social media and patient support forums are rich sources of information when it comes to patient feedback and experiences. Apart from seeking guidance on post-diagnosis journeys, many patients express their experiences with existing therapies and expectations from the upcoming treatments.

Pharma companies can benefit from this information to plan their:

  1. Targeting activities based on the understanding of target patient profiles
  2. R&D activities keeping in mind the unmet needs expressed by the patients

The complications:

  1. Social media is sensitive data to be processed given the risk of uncovering adverse events

    So, it’s utmost important to set the right compliance standards, for timely reporting of the adverse events

  2. It’s easy to be lost in the vastness of unstructured information available on the internet

    This is where the big consultancy or niche product firms fail, as they either lack technical richness or the domain expertise in producing a comprehensive solution

D Cube’s Approach:

To enable comprehensive and business-consumable results, we combine our domain knowledge and technical expertise in systemically processing unstructured social media data:

Stage Domain Knowledge Technical Expertise
Web-scraping For a given disease type, knowledge of which forums are popular among patients in expressing their questions, opinions and experiences Modular web-scraping components that can easily be customized for various patient forum websites
Data Structuring Understanding of the important data and metadata elements that needs to be extracted from patient forums Structuring the data using necessary data pre-processing steps to store the data in an efficient format, that can be directly used for building machine learning models
Entity Extraction Knowledge of the entities that are relevant for various pharma business stakeholders (e.g., Disease state, Treatment Status, Treatment Experience, etc.) and the ability to define their constituent taxonomies Ensemble Machine learning components to derive highly accurate multi-class and multi-label entities
Patient Profiling Understanding of patient attributes that are suitable for generating targeted patient profiles and ability to provide business interpretable insights for the same The hypothesis-driven approach in variable selection, followed by rigorous segmentation exercise

Contact us to request a demo to find out how D Cube can help you to derive patient level insights from cancer support forums


D Cube Analytics is an Integrated Data Sciences company focused on extracting transformational insights from syndicated, real world and digital data to increase revenue realization, avert revenue loss, enhance internal productivity and improve end user experience for global Pharmaceutical organizations.

D Cube is pioneering a Digital Transformation wave within BioPharma by leveraging new age tools and methodologies like Artificial Intelligence, Machine Learning and Robotic Process Automation to greatly improve the productivity of workforce and significantly enhancing speed to insight. Through this new age product-based approach to delivering analytics, we greatly reduce the cost and complexity of deployments and provide measurable value across multiple business functions.

Find out how D Cube can help you to elevate your market access intelligence and develop rigorous strategies that enable success in the market, throughout the product life cycle.


Reach out to us at info@dcubeanalytics.com for information and questions.

Visit Us

D Cube Analytics
IndiQube Alpha Building
# 19/4 & 27, Ground Floor,
B2 & B3 Wings, Outer Ring Rd, Kadubeesanahalli, Panathur, Bengaluru,
Karnataka 560103

Visit Us

D Cube Analytics Inc.
1320 Tower Road,
Illinois 60173, USA

All Rights Reserved D Cube Analytics 2021