# Science & Health Experts - Deep Knowledge Base
## Neuroscientist (701)
### Brain Science Fundamentals
**Brain Regions and Functions**
```
CEREBRAL CORTEX
├── Frontal Lobe
│ ├── Executive function, planning
│ ├── Decision making
│ ├── Motor control (primary motor cortex)
│ └── Speech production (Broca's area)
│
├── Parietal Lobe
│ ├── Sensory processing
│ ├── Spatial awareness
│ └── Mathematics
│
├── Temporal Lobe
│ ├── Auditory processing
│ ├── Memory (hippocampus)
│ ├── Language comprehension (Wernicke's area)
│ └── Emotion (amygdala)
│
└── Occipital Lobe
└── Visual processing
SUBCORTICAL STRUCTURES
├── Hippocampus: Memory formation
├── Amygdala: Emotion, fear response
├── Thalamus: Sensory relay
├── Hypothalamus: Homeostasis, hormones
├── Basal Ganglia: Movement, habits
└── Cerebellum: Coordination, balance
```
**Neuroplasticity Principles**
```
HEBBIAN LEARNING
"Neurons that fire together, wire together"
Mechanisms:
1. Long-Term Potentiation (LTP)
- Strengthens frequently used connections
- Basis of learning and memory
2. Long-Term Depression (LTD)
- Weakens rarely used connections
- Pruning unnecessary pathways
3. Neurogenesis
- New neuron creation (hippocampus)
- Enhanced by exercise, learning
4. Synaptogenesis
- New synapse formation
- Peaks in early development
PROMOTING NEUROPLASTICITY:
✓ Novel experiences
✓ Physical exercise
✓ Mental challenges
✓ Quality sleep
✓ Social interaction
✓ Stress management
```
### Cognitive Enhancement
**Memory Optimization**
```
ENCODING STRATEGIES
├── Elaborative encoding (make meaningful)
├── Visual imagery
├── Chunking information
├── Self-reference effect
└── Generation effect (create vs. receive)
CONSOLIDATION
├── Sleep (especially deep sleep, REM)
├── Spaced repetition
├── Active recall
└── Emotional significance
RETRIEVAL
├── Context-dependent memory
├── State-dependent memory
├── Cue-based recall
└── Testing effect
```
**Cognitive Load Theory**
```
Working Memory Capacity: 4±1 items
INTRINSIC LOAD
├── Inherent complexity of material
└── Cannot be reduced, only managed
EXTRANEOUS LOAD (Reduce this)
├── Poor instructional design
├── Unnecessary information
└── Split attention
GERMANE LOAD (Increase this)
├── Schema construction
├── Meaningful processing
└── Deep understanding
STRATEGIES:
- Segment complex information
- Remove redundancy
- Integrate related information
- Use worked examples
- Provide scaffolding
```
---
## Data Scientist/Statistician (401/712)
### Statistical Analysis Framework
**Hypothesis Testing**
```
PROCESS:
1. State hypotheses
H₀: Null hypothesis (no effect)
H₁: Alternative hypothesis (effect exists)
2. Choose significance level (α)
Typically α = 0.05
3. Select appropriate test
(see decision tree below)
4. Calculate test statistic
5. Compare p-value to α
p < α → Reject H₀
p ≥ α → Fail to reject H₀
6. Draw conclusions
COMMON ERRORS:
Type I Error (False Positive): Reject H₀ when true
Type II Error (False Negative): Fail to reject H₀ when false
```
**Statistical Test Selection**
```
ONE VARIABLE:
├── Categorical → Chi-square goodness of fit
└── Continuous → One-sample t-test
TWO VARIABLES:
├── Both Categorical → Chi-square independence
├── Both Continuous → Correlation/Regression
└── One each:
├── 2 groups → Independent t-test
├── 2 groups (paired) → Paired t-test
└── 3+ groups → ANOVA
ASSUMPTIONS CHECK:
├── Normality: Shapiro-Wilk test, Q-Q plot
├── Homogeneity: Levene's test
├── Independence: Study design
└── Sample size: n ≥ 30 for CLT
```
**Effect Size Interpretation**
```
COHEN'S D (Mean differences)
├── Small: d = 0.2
├── Medium: d = 0.5
└── Large: d = 0.8
PEARSON'S R (Correlation)
├── Small: r = 0.1
├── Medium: r = 0.3
└── Large: r = 0.5
ETA SQUARED (ANOVA)
├── Small: η² = 0.01
├── Medium: η² = 0.06
└── Large: η² = 0.14
R-SQUARED (Regression)
Proportion of variance explained
0.5 = 50% of variance explained
```
---
## Climate Scientist (704)
### Climate Science Fundamentals
**Greenhouse Effect**
```
NATURAL GREENHOUSE EFFECT:
Solar radiation → Earth's surface
Surface emits infrared radiation
Greenhouse gases absorb and re-emit
Average temperature: 15°C (vs -18°C without)
ENHANCED GREENHOUSE EFFECT:
Human activities increase GHG concentrations
More heat trapped → Global warming
MAJOR GREENHOUSE GASES:
Gas GWP* Concentration Main Source
CO₂ 1 420 ppm Fossil fuels
CH₄ 28 1900 ppb Agriculture, gas
N₂O 265 335 ppb Agriculture
HFCs 1000+ ppt levels Refrigerants
*GWP = Global Warming Potential (100-year)
```
**Climate Metrics**
```
GLOBAL INDICATORS:
├── Global mean temperature: +1.1°C since pre-industrial
├── Sea level rise: ~3.4 mm/year
├── Arctic sea ice: -13% per decade
├── Ocean acidification: -0.1 pH since 1800
CARBON BUDGET (1.5°C target):
├── Total remaining: ~400 GtCO₂
├── Current emissions: ~40 GtCO₂/year
├── Time remaining: ~10 years at current rate
TIPPING POINTS:
├── Arctic ice-free summers: 1.5-2°C
├── Greenland ice sheet: 1.5-3°C
├── Amazon dieback: 2-4°C
├── Permafrost collapse: 2-4°C
```
### Sustainability Metrics
**Carbon Footprint Analysis**
```
AVERAGE FOOTPRINTS (tCO₂e/year):
├── Global average: 4.5
├── US average: 16
├── EU average: 8
├── India average: 2
├── Target for 2°C: 2
BY CATEGORY (typical developed):
├── Transportation: 30%
├── Housing/Energy: 25%
├── Food: 20%
├── Goods/Services: 25%
REDUCTION STRATEGIES:
├── Plant-based diet: -0.8 tCO₂e
├── Avoid one flight: -1.6 tCO₂e
├── Electric vehicle: -2.4 tCO₂e
├── Renewable energy: -1.5 tCO₂e
```
---
## Epidemiologist (709)
### Disease Surveillance
**Key Epidemiological Measures**
```
INCIDENCE RATE
New cases / Population at risk × Time
Example: 50 new cases / 10,000 people / year = 0.5%
PREVALENCE
Total cases / Total population
Point prevalence: At specific time
Period prevalence: Over time period
MORTALITY RATE
Deaths / Population × Time
Crude mortality: All causes
Cause-specific: Specific disease
CASE FATALITY RATE
Deaths / Confirmed cases
Measures severity (not spread)
REPRODUCTION NUMBER (R₀)
Average secondary infections from one case
R₀ > 1: Epidemic grows
R₀ < 1: Epidemic declines
R₀ = 1: Endemic equilibrium
```
**Outbreak Investigation**
```
STEPS:
1. Confirm diagnosis/outbreak
2. Define case definition
3. Count cases (epidemic curve)
4. Orient data (time, place, person)
5. Develop hypotheses
6. Test hypotheses (analytic study)
7. Implement control measures
8. Communicate findings
EPIDEMIC CURVE PATTERNS:
Point source: Sharp peak
Continuous: Plateau
Propagated: Multiple peaks
Mixed: Combination
```
---
## Biotechnologist (703)
### Genetic Engineering Techniques
**CRISPR-Cas9 System**
```
COMPONENTS:
├── Cas9: Molecular scissors (endonuclease)
├── sgRNA: Guide to target location
└── PAM: Protospacer adjacent motif (NGG)
MECHANISM:
1. Design sgRNA complementary to target
2. sgRNA-Cas9 complex scans DNA
3. PAM recognition → DNA binding
4. Double-strand break at target
5. Repair pathway activates:
├── NHEJ: Random insertions/deletions (knockout)
└── HDR: Precise editing (with template)
APPLICATIONS:
├── Gene knockout studies
├── Disease model creation
├── Therapeutic gene editing
├── Agricultural improvements
├── Diagnostic tools
```
**PCR (Polymerase Chain Reaction)**
```
CYCLE STEPS:
1. DENATURATION (94-98°C)
Double-stranded DNA separates
2. ANNEALING (50-65°C)
Primers bind to target sequences
3. EXTENSION (72°C)
DNA polymerase synthesizes new strands
AMPLIFICATION:
Cycle 1: 2 copies
Cycle 10: ~1,000 copies
Cycle 20: ~1,000,000 copies
Cycle 30: ~1,000,000,000 copies
VARIATIONS:
├── RT-PCR: RNA to cDNA first
├── qPCR: Real-time quantification
├── Digital PCR: Absolute quantification
└── Multiplex: Multiple targets
```
---
## Pharmacologist (711)
### Drug Development Process
**Clinical Trial Phases**
```
PRECLINICAL
├── In vitro studies (cell cultures)
├── In vivo studies (animal models)
├── Safety and efficacy assessment
└── Duration: 3-6 years
PHASE I (Safety)
├── Participants: 20-100 healthy volunteers
├── Goals: Safety, dosing, pharmacokinetics
├── Duration: Several months
└── Success rate: ~70%
PHASE II (Efficacy)
├── Participants: 100-500 patients
├── Goals: Effectiveness, side effects
├── Duration: Several months to 2 years
└── Success rate: ~33%
PHASE III (Confirmation)
├── Participants: 1,000-5,000 patients
├── Goals: Confirm efficacy, monitor adverse events
├── Duration: 1-4 years
└── Success rate: ~25-30%
FDA REVIEW
├── NDA/BLA submission
├── Duration: 6-12 months
PHASE IV (Post-Market)
├── Ongoing safety monitoring
├── Real-world effectiveness
└── Long-term effects
```
**Pharmacokinetics (ADME)**
```
ABSORPTION
├── Route of administration
├── Bioavailability
├── First-pass metabolism
DISTRIBUTION
├── Blood-brain barrier
├── Protein binding
├── Volume of distribution
METABOLISM
├── Phase I: Oxidation, reduction, hydrolysis
├── Phase II: Conjugation
├── CYP450 enzymes
├── Drug interactions
ELIMINATION
├── Renal excretion
├── Biliary excretion
├── Half-life (t½)
├── Clearance rate
```
---
## Research Methodology
### Scientific Method
```
OBSERVATION
What pattern or phenomenon do you notice?
↓
QUESTION
What specific question does this raise?
↓
HYPOTHESIS
What is your testable explanation?
↓
EXPERIMENT
How will you test the hypothesis?
├── Control group
├── Experimental group
├── Variables (IV, DV, controlled)
└── Sample size and selection
↓
ANALYSIS
What do the data show?
├── Statistical tests
├── Visualizations
└── Interpretation
↓
CONCLUSION
Does evidence support hypothesis?
├── Accept/reject hypothesis
├── Limitations
└── Future directions
↓
PEER REVIEW
External validation of findings
```
### Research Design Types
```
EXPERIMENTAL
├── True experiment (randomized)
├── Quasi-experiment (non-randomized)
├── Controls for confounds
└── Can establish causation
OBSERVATIONAL
├── Cohort (follow over time)
├── Case-control (compare groups)
├── Cross-sectional (single point)
└── Cannot establish causation
QUALITATIVE
├── Interviews
├── Focus groups
├── Ethnography
├── Content analysis
└── Explores meaning and experience
MIXED METHODS
├── Sequential: One method informs another
├── Concurrent: Methods used simultaneously
└── Triangulation for validity
```