CSY201 Week 04 - Practice SIEM-style analysis before moving to reading resources.

Opening Framing: From Logs to Intelligence

Last week you analyzed logs with command-line tools. Powerful, but it doesn't scale. When you have billions of events across thousands of systems, you need a platform that can aggregate, normalize, correlate, and alert—automatically and in real-time.

That platform is a SIEM (Security Information and Event Management). SIEMs are the central nervous system of security operations. They collect data from everywhere, make it searchable, correlate events across sources, and generate alerts when threats are detected.

This week covers SIEM architecture, core capabilities, query languages, and how to build effective detections. You'll work with real SIEM concepts that apply to any platform.

Key insight: A SIEM is only as good as what you put in and how you use it. Bad data and poor rules produce noise. Good data and thoughtful rules produce actionable intelligence.

1) SIEM Architecture

Understanding SIEM components helps you use them effectively:

SIEM Core Components:

┌─────────────────────────────────────────────────────────┐
│                         SIEM                            │
│  ┌─────────┐  ┌─────────┐  ┌─────────┐  ┌─────────┐   │
│  │ Collect │→ │ Parse/  │→ │ Index/  │→ │ Analyze │   │
│  │         │  │Normalize│  │ Store   │  │ /Alert  │   │
│  └─────────┘  └─────────┘  └─────────┘  └─────────┘   │
│       ↑                                      ↓         │
│  Log Sources                            Dashboards     │
│  - Firewalls                            Alerts         │
│  - Servers                              Reports        │
│  - Endpoints                            Cases          │
│  - Cloud                                               │
└─────────────────────────────────────────────────────────┘

Data flow:
1. Collection: Ingest logs via agents, syslog, APIs
2. Parsing: Extract fields, normalize format
3. Enrichment: Add context (geo, asset, user info)
4. Indexing: Store for fast retrieval
5. Correlation: Match patterns across events
6. Alerting: Notify on detected threats
7. Visualization: Dashboards and reports

Popular SIEM Platforms:

Commercial:
- Splunk Enterprise Security
- Microsoft Sentinel
- IBM QRadar
- LogRhythm
- Exabeam
- Securonix

Open Source / Free Tier:
- Elastic Security (ELK Stack)
- Wazuh
- Graylog
- OSSIM (AlienVault)

Cloud-Native:
- Microsoft Sentinel (Azure)
- Chronicle (Google)
- Amazon Security Lake + OpenSearch

Key differentiators:
- Query language and ease of use
- Correlation capabilities
- Integration ecosystem
- Pricing model (data volume, users, features)
- Cloud vs on-premises deployment

Deployment Considerations:

Sizing factors:
- Events per second (EPS)
- Data retention requirements
- Number of log sources
- Search performance needs
- Number of concurrent users

Architecture patterns:

Small (< 5,000 EPS):
- Single server or small cluster
- 30-90 day hot storage

Medium (5,000-50,000 EPS):
- Distributed collection
- Search head cluster
- Tiered storage (hot/warm/cold)

Large (> 50,000 EPS):
- Globally distributed
- Multiple indexer clusters
- Data lake integration
- Heavy use of summarization

Key insight: SIEM architecture decisions affect everything— search speed, storage costs, and detection capability. Plan carefully before deployment.

2) SIEM Query Languages

Every SIEM has a query language. Learn the concepts, adapt to any syntax:

Splunk SPL (Search Processing Language):

# Basic search
index=security sourcetype=WinEventLog EventCode=4625

# Filter and select fields
index=security EventCode=4625 
| table _time, user, src_ip, dest

# Count by field
index=security EventCode=4625 
| stats count by src_ip 
| sort -count

# Time-based analysis
index=security EventCode=4625 
| timechart span=1h count by src_ip

# Multiple conditions
index=security (EventCode=4625 OR EventCode=4624) 
| stats count by EventCode, user

# Subsearch correlation
index=security EventCode=4625 
| stats count by src_ip 
| where count > 10 
| map search="search index=security EventCode=4624 src_ip=$src_ip$"

Elastic/KQL (Kibana Query Language):

# Basic search
event.code: 4625

# Field search
event.code: 4625 AND source.ip: 192.168.1.100

# Wildcards
user.name: admin*

# Range queries
@timestamp >= "2024-01-15" AND @timestamp < "2024-01-16"

# Boolean logic
(event.code: 4625 OR event.code: 4624) AND user.name: jsmith

# Elasticsearch DSL for complex queries
{
  "query": {
    "bool": {
      "must": [
        { "match": { "event.code": "4625" } }
      ],
      "filter": [
        { "range": { "@timestamp": { "gte": "now-24h" } } }
      ]
    }
  },
  "aggs": {
    "by_ip": { "terms": { "field": "source.ip" } }
  }
}

Microsoft Sentinel KQL (Kusto):

// Basic query
SecurityEvent
| where EventID == 4625

// Filter and project
SecurityEvent
| where EventID == 4625
| project TimeGenerated, Account, IpAddress, Computer

// Aggregation
SecurityEvent
| where EventID == 4625
| summarize count() by IpAddress
| order by count_ desc

// Time analysis
SecurityEvent
| where EventID == 4625
| summarize count() by bin(TimeGenerated, 1h), IpAddress
| render timechart

// Join tables
SecurityEvent
| where EventID == 4625
| join kind=inner (
    SecurityEvent | where EventID == 4624
) on IpAddress, Account

Key insight: Query language fluency is essential for SOC analysts. Practice until searching feels natural—speed matters during incidents.

3) Detection Rules and Correlation

SIEM value comes from detection rules that surface threats:

Detection Rule Components:

1. Data Source
   - What logs does this rule need?
   - Are they being collected?

2. Logic/Query
   - What pattern indicates the threat?
   - How specific vs. broad?

3. Threshold
   - Single event or multiple?
   - Time window for correlation?

4. Severity
   - How critical if this fires?
   - Drives response priority

5. Response
   - What action when triggered?
   - Who gets notified?

Detection Rule Examples:

# Brute Force Detection (Splunk)
index=security EventCode=4625
| stats count as failures by src_ip, user
| where failures > 5
| alert severity=medium

# Impossible Travel (Sentinel KQL)
SigninLogs
| summarize Locations=make_set(Location), 
            Times=make_list(TimeGenerated) by UserPrincipalName
| where array_length(Locations) > 1
// Additional logic to calculate travel time vs distance

# Suspicious Process Execution (Sigma - generic format)
title: Suspicious PowerShell Download
logsource:
    product: windows
    service: powershell
detection:
    selection:
        CommandLine|contains|all:
            - 'IEX'
            - 'WebClient'
            - 'DownloadString'
    condition: selection
level: high

Correlation Rule Types:

Single Event:
- One event triggers alert
- Example: Known malware hash detected
- Low false positive, limited context

Threshold:
- Count exceeds limit in time window
- Example: >10 failed logins in 5 minutes
- Common for brute force, scanning

Sequence:
- Events occur in specific order
- Example: Login failure → success → privilege escalation
- Powerful for attack chains

Anomaly:
- Deviation from baseline
- Example: User accessing unusual systems
- Requires learning period, tuning

Absence:
- Expected event doesn't occur
- Example: No heartbeat from critical system
- Useful for availability monitoring

Sigma: Universal Detection Format:

# Sigma rules are platform-agnostic
# Convert to Splunk, Elastic, Sentinel, etc.

title: Mimikatz Command Line
status: experimental
description: Detects Mimikatz execution
logsource:
    category: process_creation
    product: windows
detection:
    selection:
        CommandLine|contains:
            - 'sekurlsa::'
            - 'kerberos::'
            - 'crypto::'
            - 'lsadump::'
    condition: selection
falsepositives:
    - Security tools that use similar patterns
level: critical
tags:
    - attack.credential_access
    - attack.t1003

# Convert with sigmac tool:
sigmac -t splunk mimikatz.yml
sigmac -t es-qs mimikatz.yml

Key insight: Good detection rules are specific enough to catch real threats, but not so narrow they miss variations. Balance is an art developed through tuning.

4) SIEM Operations Best Practices

Effective SIEM use requires disciplined operations:

Data Quality:

Ensure quality data:

Collection:
□ All critical sources sending?
□ No gaps in data flow?
□ Timestamps accurate (NTP sync)?

Parsing:
□ Fields extracted correctly?
□ No parsing failures?
□ Normalized to common schema?

Enrichment:
□ Asset information accurate?
□ User context available?
□ Threat intel integrated?

Monitoring data health:
- Track EPS by source
- Alert on collection failures
- Regular data quality audits

Rule Management:

Rule lifecycle:

Development:
1. Identify detection gap
2. Research attack technique
3. Write initial rule
4. Test against historical data

Deployment:
1. Enable in detection-only mode
2. Monitor for false positives
3. Tune thresholds/logic
4. Promote to production

Maintenance:
1. Regular review of rule performance
2. Tune based on analyst feedback
3. Update for new attack variants
4. Deprecate obsolete rules

Documentation:
- What the rule detects
- Why it matters
- Expected false positives
- Investigation steps

Performance Optimization:

Query optimization:

Slow:
index=* | search error

Fast:
index=application sourcetype=app_log error

Tips:
- Specify index and sourcetype
- Filter early, aggregate late
- Use time ranges
- Avoid wildcards at start of terms
- Use summary indexes for dashboards

Resource management:
- Schedule heavy searches off-peak
- Use data models for common queries
- Archive old data to cheaper storage
- Monitor search concurrency

Key insight: SIEM is an ongoing investment. Without continuous tuning and maintenance, it becomes an expensive log warehouse instead of a detection platform.

5) Building Effective Dashboards

Dashboards transform data into actionable visibility:

Dashboard Types:

Operational Dashboard:
- Real-time metrics
- Alert queue status
- Current incidents
- Used by: SOC analysts, shift leads

Executive Dashboard:
- High-level KPIs
- Trends over time
- Risk posture
- Used by: CISO, management

Investigation Dashboard:
- Deep-dive views
- Drill-down capability
- Used by: Tier 2/3 analysts

Threat Dashboard:
- Specific threat monitoring
- Campaign tracking
- IOC hits
- Used by: Threat intel team

Effective Dashboard Design:

SOC Operations Dashboard:

┌─────────────────────────────────────────────────────┐
│  Open Alerts: 47    │  Critical: 3   │  MTTD: 4.2h │
├─────────────────────┴──────────────────────────────┤
│                                                     │
│  [Alert Volume - Last 24h - Time Chart]            │
│                                                     │
├─────────────────────┬───────────────────────────────┤
│ Top Alert Types     │  Top Source IPs               │
│ 1. Failed Login 23  │  1. 192.168.1.50  15         │
│ 2. Malware Det. 12  │  2. 10.0.0.25     12         │
│ 3. Policy Viol. 8   │  3. 203.0.113.5   8          │
├─────────────────────┴───────────────────────────────┤
│ Recent Critical Alerts                              │
│ • 10:23 - Ransomware detected - SERVER01           │
│ • 10:15 - Data exfil alert - WS-FINANCE-42         │
│ • 09:58 - Brute force success - DC01               │
└─────────────────────────────────────────────────────┘

Design principles:
- Most important info at top
- Use color purposefully (red=critical)
- Enable drill-down to details
- Refresh appropriately (not too fast)
- Avoid chart junk—every element should inform

Key insight: A good dashboard answers questions at a glance. If analysts have to think hard to interpret it, redesign it.

Real-World Context: SIEM in SOC Operations

SIEM is the SOC's primary tool:

Daily Operations: Analysts start their day in the SIEM—checking the alert queue, reviewing overnight activity, searching for anomalies. The SIEM is where investigations begin and evidence is gathered.

Incident Response: During incidents, the SIEM provides the timeline. What systems were accessed? What data was touched? When did the attack start? Incident commanders rely on SIEM queries to understand scope and impact.

Compliance: Auditors want evidence of security monitoring. SIEM provides logs, alerts, and reports that demonstrate due diligence. Many compliance frameworks require SIEM or equivalent.

MITRE ATT&CK Integration:

Detection Coverage: Map rules to ATT&CK techniques
Gap Analysis: Identify techniques without detection
Threat Intel: Search for technique-specific IOCs

Key insight: SIEM mastery is career-defining for SOC analysts. The analyst who can quickly find answers in the SIEM is invaluable during incidents.

Guided Lab: SIEM Query Practice

Let's practice SIEM queries using common scenarios. These exercises use generic syntax—adapt to your platform.

Step 1: Basic Event Search

# Scenario: Find all failed Windows logins in last 24 hours

# Splunk:
index=windows EventCode=4625 earliest=-24h

# Elastic KQL:
event.code: 4625 AND @timestamp >= now-24h

# Sentinel KQL:
SecurityEvent
| where TimeGenerated > ago(24h)
| where EventID == 4625

Step 2: Aggregation Query

# Scenario: Count failed logins by source IP

# Splunk:
index=windows EventCode=4625 earliest=-24h
| stats count by src_ip
| sort -count

# Elastic:
POST /security-*/_search
{
  "query": { "match": { "event.code": "4625" } },
  "aggs": { "by_ip": { "terms": { "field": "source.ip" } } }
}

# Sentinel:
SecurityEvent
| where TimeGenerated > ago(24h)
| where EventID == 4625
| summarize count() by IpAddress
| order by count_ desc

Step 3: Correlation Query

# Scenario: Find IPs with failed logins followed by success

# Splunk:
index=windows EventCode=4625 earliest=-24h
| stats count as failures by src_ip
| where failures > 3
| join src_ip [search index=windows EventCode=4624 earliest=-24h]

# Sentinel:
let failed = SecurityEvent
| where TimeGenerated > ago(24h) and EventID == 4625
| summarize FailCount=count() by IpAddress
| where FailCount > 3;
let success = SecurityEvent
| where TimeGenerated > ago(24h) and EventID == 4624;
failed | join kind=inner success on IpAddress

Step 4: Time-Based Analysis

# Scenario: Show login failures over time

# Splunk:
index=windows EventCode=4625 earliest=-7d
| timechart span=1h count

# Sentinel:
SecurityEvent
| where TimeGenerated > ago(7d)
| where EventID == 4625
| summarize count() by bin(TimeGenerated, 1h)
| render timechart

Step 5: Complex Hunt Query

# Scenario: Find PowerShell downloading content

# Splunk:
index=windows sourcetype=WinEventLog:PowerShell
| search "*WebClient*" OR "*DownloadString*" OR "*DownloadFile*"
| table _time, host, user, Message

# Sentinel:
SecurityEvent
| where EventID == 4104  // Script block logging
| where EventData contains "WebClient" or 
        EventData contains "DownloadString"
| project TimeGenerated, Computer, Account, EventData

Step 6: Reflection (mandatory)

Which query type (basic, aggregation, correlation) is most useful for your daily work?
How would you optimize a slow-running query?
What additional data would make these queries more effective?
How do these queries map to detection rules?

Week 4 Outcome Check

By the end of this week, you should be able to:

Explain SIEM architecture and components
Write basic queries in at least one SIEM platform
Understand detection rule types and structure
Apply SIEM operations best practices
Design effective security dashboards
Use Sigma for platform-agnostic detection

Next week: Alert Triage and Investigation—turning SIEM alerts into investigated incidents.

🎯 Hands-On Labs (Free & Essential)

Practice SIEM-style analysis before moving to reading resources.

🎮 TryHackMe: Intro to SOC (SIEM Concepts)

What you'll do: Explore how alerts are generated and triaged from centralized logs.
Why it matters: SIEM is the SOC's primary visibility layer.
Time estimate: 1-1.5 hours

Start TryHackMe Intro to SOC →

📝 Lab Exercise: SIEM Query Worksheet

Task: Write five detection queries (auth failures, rare processes, admin logins).
Deliverable: Query list with a one-line detection goal for each.
Why it matters: Good detections are clear, scoped, and testable.
Time estimate: 60-90 minutes

🏁 PicoCTF Practice: Forensics (Alert Context)

What you'll do: Solve beginner challenges that mimic alert investigation context.
Why it matters: SIEM alerts still require evidence validation.
Time estimate: 1-2 hours

Start PicoCTF Forensics →

🛡️ Lab: Kernel Module Audit

What you'll do: Audit loaded kernel modules and identify unsigned or suspicious modules.
Deliverable: Module inventory + notes on expected vs anomalous drivers.
Why it matters: Kernel modules can hide rootkits and persistence.
Time estimate: 60-90 minutes

💡 Lab Tip: Start with high-signal detections (auth anomalies, admin activity) before broad queries.

🛡️ Kernel Security Auditing

SIEM visibility improves when you understand the kernel layer. Kernel modules and drivers are high-risk because they run with full privileges and can hide activity.

Kernel audit checklist:
- Enumerate loaded modules/drivers
- Verify module signatures and provenance
- Monitor for unsigned or newly loaded modules
- Baseline expected kernel extensions

📚 Building on CSY102: Kernel concepts and process isolation; apply to SOC baselining.

Resources

Complete the required resources to build your foundation.

Splunk Search Tutorial · 60-90 min · 50 XP · Resource ID: csy201_w4_r1 (Required)
Sigma Rules Repository · 45-60 min · 50 XP · Resource ID: csy201_w4_r2 (Required)
KQL (Kusto) Reference · Reference · 25 XP · Resource ID: csy201_w4_r3 (Optional)

Lab: Detection Rule Development

Goal: Develop, test, and document detection rules for common attack techniques.

Part 1: Analyze Attack Technique

Select one ATT&CK technique:
- T1059.001 - PowerShell
- T1003.001 - LSASS Memory
- T1566.001 - Spearphishing Attachment
Research the technique:
- How does it work?
- What artifacts does it leave?
- What log sources capture it?

Part 2: Write Detection Rule

Write the rule in Sigma format
Include:
- Title and description
- Log source specification
- Detection logic
- False positive notes
- ATT&CK mapping
Convert to your SIEM's query language

Part 3: Test the Rule

If possible, generate test data (in lab environment)
Run the rule against sample data
Document:
- True positive rate
- False positive rate
- Tuning recommendations

Part 4: Create Response Playbook

Document investigation steps for this alert
Include:
- Initial triage questions
- Additional queries to run
- Escalation criteria
- Containment actions

Deliverable (submit):

Technique research summary
Sigma rule file
SIEM query conversion
Test results (if applicable)
Response playbook for the detection

Checkpoint Questions

What are the main components of SIEM architecture?
What is the difference between a threshold rule and a sequence rule?
What is Sigma and why is it useful?
How would you optimize a slow SIEM query?
What makes a good SOC operations dashboard?
Why is data quality important for SIEM effectiveness?

Week 04 Quiz

Test your understanding of SIEM architecture, queries, and detection rules.

Format: 10 multiple-choice questions. Passing score: 70%. Time: Untimed.

Take Quiz

Weekly Reflection

Reflection Prompt (200-300 words):

This week you learned SIEM fundamentals—the platform that powers modern security operations. You practiced queries, explored detection rules, and considered operational best practices.

Reflect on these questions:

SIEM platforms are expensive and complex. What justifies this investment? When might simpler solutions suffice?
Detection rules require constant tuning. How would you prioritize which rules to develop and maintain?
Many organizations have SIEMs but don't use them effectively. What do you think causes this gap?
How does SIEM query skill compare to command-line analysis? When would you use each approach?

A strong reflection will consider both the power and the challenges of SIEM-based security operations.

Verified Resources & Videos

Splunk Free Training: Splunk Fundamentals
Elastic SIEM: Elastic Security Guide
Detection Engineering: Awesome Detection Engineering

SIEM mastery takes time—these platforms are deep. Focus on query fundamentals first, then build toward detection engineering. The ability to quickly find answers in your SIEM is one of the most valuable SOC skills. Next week: using those skills for alert triage and investigation.