HyperAIHyperAI

Command Palette

Search for a command to run...

Lung Cancer Risk Lung Cancer Risk Dataset

Date

2 months ago

Publish URL

www.kaggle.com

License

Other

Join the Discord Community

*This dataset supports online use.Click here to jump.

Lung Cancer Risk is a tabular dataset released in 2025 for lung cancer risk prediction and health factor analysis. It aims to explore the association between smoking habits, lifestyle and lung cancer risk through multidimensional features.

This dataset contains 50,000 patient profiles based on known lung cancer risk factors (such as lifestyle, environmental exposures, and family history). Approximately 25% of positive cases reflect the real-world prevalence of lung cancer. Each sample is comprised of multiple health and behavioral characteristics, making it suitable for lung cancer risk modeling, medical machine learning research, health prediction system development, and educational experiments. It is particularly valuable in classification modeling and risk assessment scenarios.

Data composition:

Each sample contains multiple dimensions of health and behavioral characteristics, including:

  • Basic information: age, gender
  • Lifestyle habits: smoking status, drinking frequency
  • Health factors: chronic diseases (such as hypertension, diabetes), lung-related diagnoses
  • Environmental exposures: radon levels, asbestos exposure, secondhand smoke exposure
  • Family history: whether there is a history of cancer or lung disease in the family
  • Target variable: Lung cancer diagnosis (whether or not)

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Lung Cancer Risk Lung Cancer Risk Dataset | Datasets | HyperAI