fused layernorm #1105

yang · 2023-12-24T08:00:07Z

Add simple util for timings
Add fused layernorm kernel from Megatron

CLAassistant · 2023-12-24T08:00:13Z

All committers have signed the CLA.

megatron/core/utils.py

megatron/devutil.py

megatron/model/norms.py

megatron/neox_arguments/neox_args.py

Closes EleutherAI#952

jahatef · 2024-01-20T01:46:29Z

Howdy,

Upon testing, we found no speedup and eventual slowdown when training a 1-3B model for 320 iterations. You can see my test here.

StellaAthena · 2024-01-20T22:39:36Z

Howdy,

Upon testing, we found no speedup and eventual slowdown when training a 1-3B model for 320 iterations. You can see my test here.

If anything, it looks like the baseline is the one with anomalous behavior here. Did you double and triple check by running it multiple times? How sure are you that something weird didn't just happen by magic to cause the baseline to speed up?

jahatef · 2024-01-21T16:25:05Z

@StellaAthena I agree, so I ran some extra tests, still available from the above link. I see no difference between fused layer norm being on/off within the variance of the runs.

Quentin-Anthony · 2024-01-21T17:17:24Z

I looked over these results with @jahatef and agree it falls within variance. I'm going to run some CUDA profiling to check if data movement is decreased with the layernorm kernel, and will merge if so. This would be on the grounds that as newer GPUs spend progressively less time on GEMMs, data movement becomes more critical.

StellaAthena · 2024-01-22T16:40:08Z

Yeah that looks correct to me as well.

Quentin-Anthony · 2024-01-24T21:20:42Z

I see meaningful decrease in data movement and confirmed the preservation of accuracy. Merging. Thanks a ton for this @yang

yang requested a review from Quentin-Anthony as a code owner December 24, 2023 08:00

Quentin-Anthony reviewed Jan 4, 2024

View reviewed changes

megatron/core/utils.py Outdated Show resolved Hide resolved

Quentin-Anthony reviewed Jan 4, 2024

View reviewed changes

megatron/devutil.py Outdated Show resolved Hide resolved

Quentin-Anthony reviewed Jan 4, 2024

View reviewed changes

megatron/model/norms.py Outdated Show resolved Hide resolved

yang force-pushed the fused-layernorm branch 2 times, most recently from 6af218c to e9eed53 Compare January 5, 2024 01:43

Quentin-Anthony reviewed Jan 5, 2024

View reviewed changes

megatron/neox_arguments/neox_args.py Outdated Show resolved Hide resolved

yang added 2 commits January 13, 2024 22:28

Add simple util for CUDA timings

24199ec

Add fused layernorm kernel from Megatron

4065730

Closes EleutherAI#952

yang force-pushed the fused-layernorm branch from e9eed53 to 4065730 Compare January 13, 2024 22:29

Quentin-Anthony previously approved these changes Jan 24, 2024

View reviewed changes

change default fused layernorm to false

6ab6331

jahatef dismissed Quentin-Anthony’s stale review via 6ab6331 January 26, 2024 23:03

jahatef added 2 commits January 26, 2024 18:04

Update test_setup.yml

8b223f9

Update test_train_base.yml

8abfbbd

Quentin-Anthony approved these changes Jan 26, 2024

View reviewed changes

Quentin-Anthony merged commit 3d8fec0 into EleutherAI:main Jan 26, 2024
2 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fused layernorm #1105

fused layernorm #1105

yang commented Dec 24, 2023

CLAassistant commented Dec 24, 2023 •

edited

Loading

jahatef commented Jan 20, 2024

StellaAthena commented Jan 20, 2024 •

edited

Loading

jahatef commented Jan 21, 2024

Quentin-Anthony commented Jan 21, 2024

StellaAthena commented Jan 22, 2024

Quentin-Anthony commented Jan 24, 2024

fused layernorm #1105

fused layernorm #1105

Conversation

yang commented Dec 24, 2023

CLAassistant commented Dec 24, 2023 • edited Loading

jahatef commented Jan 20, 2024

StellaAthena commented Jan 20, 2024 • edited Loading

jahatef commented Jan 21, 2024

Quentin-Anthony commented Jan 21, 2024

StellaAthena commented Jan 22, 2024

Quentin-Anthony commented Jan 24, 2024

CLAassistant commented Dec 24, 2023 •

edited

Loading

StellaAthena commented Jan 20, 2024 •

edited

Loading